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SCENE  ANALYSIS:  A  SURVEY 


Presented  in  this  appendix  is  a  survey  which  traces  the  development  of  scene  analysis 
by  computer  from  its  origins  in  digitized  picture  processing  and  pattern  recognition, 
and  discusses  geometric  concepts  related  to  projection.  New  approaches  based  on 
projective  geometry  and  neurophysical  models  are  suggested. 
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CHAPTER  1 


INTRODUCTION 


Computers  are  often  considered  extensions  of  the  mind  in 
the  same  sense  that  manual  tools  are  extensions  of  the  hand. 

In  this  machine-roan  analogy,  physical  measuring  devices 
(photocells,  pressure  sensitive  switches,  etc.)  are  the 
sensory  cells  of  organs  of  sensation  and  the  computer  receiv¬ 
ing  messages  frem  them  functions  as  an  organ  of  perception. 

In  many  applications  of  the  analogy,  e.g.  process  control, 
the  computer  exhibits  mere  sensation  and  reflexes.  In 
artificial  intelligence  (henceforth  AI) ,  however,  computer 
systems  are  designed  expressly  to  exhibit  perception,  comprehen¬ 
sion,  curiosity,  and  intention.  Though  rather  anthropomorphic, 
this  aim  is  a  primary  motivation  for  most  AI  researchers  and 
raises  many  important  philosophical  and  psychological  questions 
whose  discussion  has  already  changed  our  way  of  looking  at 
human  intelligence.  Naive  popular  interpretations  of  AI 
include  conjurations  of  evil  or  benevolent  robots  or,  no  less 
anthropomorphic,  the  amoral  oracle  which  answers  questions 
truthfully  but  refuses  ("stubbornly,  proudly")  to  consider 
the  good  or  evil  consequences.  The  escalating  impact  of 
computers  on  society  elevates  the  importance  of  philosophical 
questions  about  AI  from  resolving  esoteric  problems  to  making 
decisions  about  social  policy.  The  reader  is  directed  to  an 
eloquent  examination  of  the  philosophical  questions  by  Turing 
(BIBGEN  1947^*  and  an  investigation  of  the  social  impact  of 
AI  by  Firschein  et  al.  (BIBGEN  1973) .  Such  philosophical 
questions  are  far  beyond  the  level  of  this  survey.  However, 
the  psychological  analogy  is  useful  in  introducing  a  unified 
analysis  of  artificial  visual  systems  whose  characteristics 
would  otherwise  appear  quite  diverse.  Since  artificial  visual 
systems  are  often  aimed  at  imitating,  improving  on,  or 
generalizing  from  natural  visual  systems,  the  analogy  is  not 
as  far-fetched  or  poetic  as  it  might  seem. 

*  See  Chapter  4  for  key  to  bibliographic  references  in  the  text. 


For  the  purposes  of  this  paper  I  will  define  scene  analysis 
as  the  computer  processing  of  two-dimensional  projected  images 
of  three-dimensional  scenes,  usually  typical  of  what  humans  see 
in  everyday  life,  to  yield  a  data  structure  which  somehow 
captures  the  individual  identity  of  and  spatial  relations 
between,  objects  in  the  3-D  world.  Scene  analysis  is  then 
computer  visual  perception  and  comprehension.  This  rather 
narrow  definition  deliberately  excludes  holography  which, 
though  it  can  give  an  accurate  reversible  (i.e.  yielding  original 
3-D  surface),  representation  of  three  dimensional  objects, 
treats  the  visual  world  as  a  mathematical  surface  devoid  of 
meaning.  I  also  exclude  picture  processing  which  though  often 
applied  to  enhance  images  of  objects  to  be  treated  by  human 
or  machine  viewers  as  three  dimensional,  (e.g.  stereophotos  in 
cartography)  has  not  until  recently  addressed  itself  directly 
to  the  data  structure  mentioned  in  this  paragraph's  opening 
sentence.  The  two  preceding  areas  are  not  meant  to  be  slighted 
by  omission;  they  are  among  the  most  fruitful  application  areas 
of  computer  processing  of  pictorial  information.  They  are 
certainly  in  a  more  advanced  state  of  application  than  scene 
analysis.  Their  strong  relation  to  scene  analysis  and  the 
area  of  overlap  will  be  discussed.  In  this  survey,  I  restrict 
the  scope  more  to  semantic,  relational  aspects  involved  in 
the  representation  of  objects  rather  than  the  purely  numerical, 
mathematical  surface  defined  by  the  locus  of  surfaces  intersected 
by  the  line  of  sight.  "Semantics"  can  mean  a  number  of  things, 
so  let  me  exclude  also  the  most  common  meaning  in  AI ,  viz., 
semantic  data  structures  such  as  Winograd’s  (BIBGEN  1972)  where 
physical  objects  are  represented  devoid  of  their  geometric 
coordinates  in  terms  of  labelled  items  and  their  relations  in 
list  structures  for  the  purpose  of  manipulation  and  inference. 
This  too  is  a  very  important  and  closely  related  area  whose 
connections  with  scene  analysis  will  be  discussed  though  not 
thoroughly  surveyed.  It  is  fraught  with  many  of  the  open-ended 
problems  of  the  general  representation  of  knowledge  in  AI 
independent  of  vision.  We  then  wedge  ourselves  narrowly  between 
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applied  mathematics  (physics ,  geometry  and  computer  processing 
of  pictures)  and  semantics  (knowledge,  meaning),  using  both 
for  support.  I  hope  to  show  that  many  important  advances  in 
scene  analysis  implicitly  embody  the  curious  semantics  of 
projective  geometry,  but  often  expressed  in  forms  specifically 
suited  to  optics  and  digital  computation  rather  than  the 
general  axiomatic  form  used  in  mathematics. 

This  survey  consists  of  four  chapters  and  a  bibliography. 
Chalter  1,  which  you  are  now  reading,  is  a  short  introduction 
to  scene  analysis  and  guide  to  the  rest  of  the  paper.  Chapter  2 
is  the  most  important  one.  It  consists  of  a  history  and  state 
of  the  art  description  of  scene  analysis  techniques.  It  is 
organized  according  to  type  of  approach,  in  roughly  historical 
order.  Chapter  3  is  a  conclusion  which  contains  a  critical 
overview  of  the  techniques  and  a  preview  of  methods  which  might 
be  successfully  applied  in  the  future.  Chapter  4  is  a  guide  to 
the  bibliography.  This  includes  a  description  of  the  kinds  of 
sources  available  and  their  relation  to  rese  ireh  institutions. 
Chapter  4  should  be  skimmed  immediately  in  o::der  to  understand 
the  format  of  references  to  the  bibliography  in  the  text  of 
this  survey. 


IV.  1-8 


'  / 


CHAPTER  2.  SCENE  ANALYSIS  METHODS 


2.1  Introductory  Remarks 

An  important  goal  of  scene  analysis  reseaz:ch  is  to  build 
artificial  eyes  connected  with  control  systems  which  enable  a 
robot  to  manipulate  and/or  maneuver  in  its  environment. 
Applications  include  the  design  of  visual  systems  for  industrial 
robots,  automatic  pilots  for  vehicles,  mechanical  extraterres¬ 
trial  explorers,  and  other  devices.  To  organize  the  discussion 
of  diverse  approaches,  scene  analysis  methods  are  subjectively 
divided  into  several  areas  in  this  chapter.  Two  major  areas 
are  line  analysis  and  region  analysis.  The  former  is  based  on 
classifying  and  relating  to  each  other  the  2-D  images  of  3-D 
edges  and  vertices  of  polyhedra  in  a  scene.  The  latter  is 
based  on  merging  adjacent  picture  regions  with  similar 
properties.  These  and  other  areas  will  be  described  and 
compared.  The  order  of  discussion  will  be  historical  except 
when  that  conflicts  with  organization  by  area. 

Many  of  the  techniques  discussed  appear  superficially  to 
differ  greatly  from  each  other.  A  deeper  analysis  in  terms 
of  projective  geometry,  however,  often  points  out  implicit 
exploitation  of  similar  phenomena.  This  underlying  connection 
is  not  often  discussed  in  the  literature  but  will  be  brought 
out  to  unify  description  of  techniques  in  this  chapter  and 
suggest  fruitful  areas  for  new  research  in  the  next. 

Scene  analysis  owes  much  to  earlier  work  in  picture 
processing  and  pattern  recognition.  These  were  the  first 
areas  in  which  methods  were  developed  for  representing,  analyz¬ 
ing,  and  manipulating  pictorial  (i.e.  2-D)  information  by 
computer.  This  essentially  geometric  information  processing 
is  quite  different  from  more  traditional  number  crunching  or 
text  manipulation  computer  applications.  Among  the  differences 
are  that  picture  manipulation  requires  much  more  processible 
memory,  and  the  processes  are  conceptually  two-dimensional 
rather  than  one-dimensional.  The  geometric  problems  and 
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techniques  of  pattern  recogniur  .  ind  scene  analysis  have  much 
in  common  but  also  some  crucial  differences.  In  the  psycho¬ 
logical  model  of  mechanical  vision,  the  aim  of  scene  analysis 
is  to  perceive  and  understand  2-D  images  of  3-D  scenes.  The 
meaning  of  this  analogy  can  be  clarified  using  a  rudimentary 
informational  model;  this  yields  a  natural  hierarchy  from 
physical  measurement  through  pattern  recognition  to  scene 
analysis.  The  nature  of  this  hierarchy  is  examined  in  the 
following  discussion  of  picture  processing  and  pattern  recogni¬ 
tion.  The  reader  can  find  comprehensive  bibliographies  of 
picture  processing  accompanied  by  clear  descriptions  of  tech¬ 
niques  in  Rosenf eld's  excellent  surveys  (BIBPIC  1969,1972-1975). 
Any  of  the  titles  containing  the  words  "Pattern  Recognition" 
in  BIBPIC  can  introduce  the  reader  to  that  topic. 


2.2  Pattern  Recognition  and  Scene  Analysis 

A  physical  measuring  instrument  can  be  considered  to  be  a 
finite  state  device,  each  of  whose  states  corresponds  to  a 
range  of  physical  conditions  of  the  object  it  is  measuring. 
Rothstein  (BIBGEN  1956)  has  shown  how  the  amount  of  information 
about  the  physical  object  derived  from  such  a  measurement  can 
be  formally  defined  in  terms  of  thermodynamic  entropy,  thereby 
linking  computational  and  physical  concepts.  Though  the 
measuring  device  is  finite  state,  and  information  consists  of 
reduction  of  uncertainty  about  which  of  these  states  correctly 
describes  the  measured  object,  the  physical  state  space  is 
usually  thought  of  as  a  one-dimensional  continuum  with  a  metric. 
That  is,  the  finite  set  of  states  is  well-ordered  and  can  be 
divided  into  subsets  with  that  same  property  without  limit 
(until  the  quantum  uncertainty  limit)  and  differences  between 
values  of  state  variables  have  meaningful  (comparable)  magni¬ 
tudes.  The  psychological  analogue  of  such  information  could 
be  termed  a  sensation  such  as  the  sensation  of  temperature, 
brightness,  or  pressure;  these  quantities  are  usually  considered 
as  belonging  to  one-dimensional  intensity  continue.  The  problem 
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of  classifying  such  information  is  usually  trivially  solved  by 
dividing  the  continuum  into  ranges  and  ordering  observations 
relative  to  the  limits  of  these  ranges. 

Pattern  recognition  involves  the  classification  of  patterns 
consisting  of  the  cartesian  products  of  many  (usually)  measure¬ 
ment  s  into  a  small  (usually)  number  of  sets.  The  large  dimen¬ 
sionality  of  measurements  not  only  obliterates  the  simple  ordering 
relation  described  in  the  preceding  paragraph  but  also  yields 
astronomical  numbers  of  distinct  possible  patterns.  A  common 
approach  to  simplifying  classification  is  to  pre-process  the 
information  by  extracting  features,  abstractions  of  simple  prop¬ 
erties  defined  by  relations  between  members  of  subsets  of  the 
pattern  space.  The  number  of  features  is  usually  far  smaller  than 
the  number  of  measurements  and  they  are  chosen  so  that  they  meas¬ 
ure  some  pattern  properties  related  to  the  desired  classification. 
Ti.a  cartesian  product  of  a  number  of  features  constitutes  feature 
space;  points  in  this  space  derived  from  patterns  which  are  to  be 
assigned  to  the  same  set  in  pattern  classification  ought  to 
cluster  together  more  than  points  derived  from  differing  patterns. 
By  defining  a  distance  function  between  points  in  feature  space, 
classificatron  is  achieved  by  assigning  a  new  point  to  the  near¬ 
est  cluster.  The  kinds  of  features  chosen  are  crucial  to  effective 
clustering  and  highly  task  specific.  Many  of  the  tools  of  pattern 
recognition  are  derived  from  sophisticated  statistical  decision 
theory  applied  to  the  assignment  of  points  in  feature  space  to 
clusters.  Tou  and  Gonzalez  (BIBPIC  1974)  contains  detailed 
description  of  these  tools  and  a  good  bibliography  for  other 
sources  in  pattern  recognition.  In  our  psychological  analogy, 
if  measurement  is  associated  with  sensation,  pattern  recognition 
is  associated  with  perception.  Assigning  large  numbers  of  dis¬ 
tinct  patterns  to  a  single  class  corresponds  to  recognizing  a 
form  or  percept  in  any  one  of  a  large  number  of  differing  parti¬ 
cular  presentations. 

In  visual  (also  called  optical  or  pictorial)  pattern  recogni¬ 
tion,  the  patterns  to  be  classified  are  usually  gray-level  (also 
called  grey-scale)  pictures.  These  are  2-D  pictures  which  have 


IV. l-ll 


been  divided  into  cell*  by  a  regular  grid.  Each  cell  ia  charac¬ 
terized  by  a  single  number  (measurement)  corresponding  to  the 
integrated  intensity  of  light  throughout  that  cell.  Any  picture 
is  then  a  pattern  or  point  in  a  space  with  as  many  dimensions  as 

there  are  cells  in  the  grid.  Hence,  for  an  n  *  n  grid,  the 

2 

pattern  space  has  dimension  n  .  The  earliest  efforts  in  pictor¬ 
ial  pattern  recognition  were  directed  to  the  problem  of  recogniz¬ 
ing  printed  characters  such  as  numerals  or  letters  of  the 
alphabet  by  template  matching,  described  as  follows.  The  gray- 
level  in  each  cell  is  either  1  or  0  depending  on  whether  ink 
(darkness)  is  present  or  absent  in  that  part  of  the  picture.  The 

ideal  example  of  any  character  is  called  a  template  and  is 

2 

represented  by  the  n  component  vector  of  l's  and  0's.  Classifi¬ 
cation  of  other  samples  is  achieved  by  identifying  each  with  the 
ideal  (template)  vector  it  best  matches;  by  "best”  is  meant  that 
for  which  the  largest  number  of  corresponding  components  match  in 
the  two  vectors.  The  method  fails  completely  if  the  sample  to  be 
recognized  is  not  almost  identical  in  size,  position  and  orienta¬ 
tion  to  the  template.  Two-dimensional  transformations  "scramble" 

2 

the  n  -dimensional  vector.  This  limitation  can  be  overcome  by 

2 

exploiting  the  fact  that  the  n  vector  can  be  "unfolded"  into  the 
two-dimensional  picture  space.  In  this  much  more  tractable  space, 
applying  simple  geometric  transformations  can  invert  the  effects 
of  2-D  translation,  rotation,  scale  change  and  other  one-to-one 
transformations  to  yield  a  pattern  in  registration  with  any 
template.  Such  techniques  have  been  applied  to  yield  optical 
character  recognition  (OCR)  devices  capable  of  reading  printed 
text  into  a  computer  automatically. 

When  geometric  transformations  are  unpredictable,  not 
invertible  or  too  complex,  for  example  in  recognizing  handwritten 
characters,  the  methods  described  in  the  preceding  paragraph  are 
inadequate.  In  more  complex  problem  domains,  feature  extraction 
can  be  very  helpful.  Unlike  the  case  of  general  pattern  recog¬ 
nition  where  feature  extraction  operates  over  the  many  dimensional 
space  of  the  cartesian  product  of  measurements,  in  pictorial 


pattern  recognition,,  feature  extraction  usually  operates  over  the 
two-dimensional  picture  space.  That  is,  features  in  the  latter 
case  often  represent  simple  geometric  properties.  Some  corres¬ 
pond  to  local  relations  such  as  shape,  orientation,  curvature, 
contrast  and  number  of  lines  at  an  intersection  while  others 
reflect  more  global  relations  such  as  topological  connectivity, 
visual  texture,  repetitiveness,  size  or  alignment  of  parts. 

Spatial  frequency  analysis  including  hologram  analysis  can  be 
regarded  as  the  extraction  of  global  features  related  to  geometric 
symmetries.  Presence  of  such  features  corresponds  to  peaks  in 
the  frequency  domain.  Families  of  orthogonal  square  wave  functions 
such  as  Walsh,  Haar  and  Hadamard  (e.g.  Gerardin  and  Flament, 

BIBPIC  1969)  functions  are  computationally  much  simpler  for 
spatial  analysis  thar  the  trigonometric  functions  used  in  Fourier 
analysis  partly  because  inner  products  in  the  former  case  can 
reduce  to  boolean  operations  and  also  because  their  digital  nature 
is  more  appropriate  for  grey-scale  (digitized)  pictures.  However, 
the  Fourier  transform  itself  can  be  cheaply  and  rapidly  effected 
using  optical  holography.  A  useful  characteristic  of  spatial 
frequency  analysis  is  that  geometric  distortions  and  their 
inverses  in  the  picture  domain  often  correspond  to  easily  expressed 
changes  in  the  transform  domain,  greatly  simplifying  the  problem 
of  picture  registration  in  generalizations  of  template  matching. 
Another  useful  characteristic  is  that  the  image  degrading  effects 
of  high  frequency  noise  resulting  from  digitization  of  poor  optics 
can  be  effectively  countered  by  low-pass  filtering  in  the  frequency 
domain.  Also,  spatial  integration  and  derivative  taking  are 
expressed  in  simple  algebraic  terms  in  the  transform  domain.  The 
disadvantages  of  spatial  frequency  analysis  are  computational 
expense  and  inflexibility  in  choice  of  features.  The  technique 
is  useful  only  if  certain  kinds  of  global  geometric  symmetries 
are  suitable  for  distinguishing  pattern  classes.  Then,  special 
purpose  hardware  can  overcome  the  computational  expense  of  general 
purpose  softeware.  Fingerprint  classification  and  visual  texture 
analysis  are  examples  of  tasks  where  this  is  true.  One  advantage 
in  extracting  features  of  whatever  type  is  that  pattern  classifica- 
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tion  can  often  be  accomplished  by  template  matching  in  the  2-D  trans- 

2 

form  apace  more  easily  than  in  the  n  dimensional  picture  space. 

Contrast  features  have  been  particularly  useful  in  a  wide 
variety  of  applciations,  including  scene  analysis.  These  are 
iocal  features  corresponding  to  small,  simple  neighborhoods 
exhibiting  strong  variation  in  the  gray  levels  of  their  consti¬ 
tuent  cells.  In  simple  pictures  composed  of  relatively  large 
uniformly  light  areas  differing  from  each  other  in  gray  level, 
such  high  contrast  neighborhoods  outline  the  regions.  The 
picture  can  then  be  represented  as  a  collection  of  chains  of  high 
contrast  boundary  curves.  In  a  sense  this  corresponds  to  storing 
only  the  locations  of  non-zero  gray-level  gradients  rather  than 
all  gray  levels.  Since  gradients  are  zero  in  large  uniform 
regions,  far  less  information  is  needed  to  specify  the  picture 
than  the  gray-level  representation,  though  the  latter  can  be 
reconstructed  by  a  computation  analogous  to  integration.  This 
economy  is  vital  because  the  gray-level  representation  often 
requires  more  central  computer  memory  than  is  available  at  one 
time. 

Representing  any  curve,  including  a  boundary,  by  encoding 
differences  between  coordinates  of  successive  cells  along  the 
curve  rather  than  listing  locations  is  called  chain  encoding. 

This  difference  in  coordinates  corresponds  roughly  to  the 
digitized  derivative  or  slope  and  economizes  storage  for  reasons 
analogous  to  those  given  in  the  preceding  paragraph  describing 
the  gray-level  to  gradient  transition.  In  addition,  however, 
chain  encoding  yields  digitized  descriptions  of  geometric  shape, 
independent  of  position,  which  can  be  used  to  compute  such 
measures  as  area,  curvature  and  perimeter  as  veil  as  express 
the  changes  in  patterns  resulting  from  geometric  transformations. 
Such  digitial  computations  have  been  developed  by  Freeman  (BIBPIC 
1974) ,  Weiman  and  Rothstein  (BIBTR  1972) ,  and  Rothstein  and 
Weiman  (BIBPIC  1976) .  Applications  range  from  image  processing 
to  computer  graphics.  The  discrete  expression  of  these 
geometric  computations  is  often  completely  different  from  their 
expression  in  continuous  mathematics .  The  processes  of  geometric 
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abstraction  outlined  above  play  an  important  role  in  pattern 
recognition  in  part  because  of  the  resulting  computational 
economies.  One  such  economy  is  realized  in  approximating  curves 
by  straight  line  segments.  Since  the  curve  is  completely  deter¬ 
mined  by  its  endpoints,  only  they  need  be  stored.  In  scene 
analysis,  straight  lines  also  play  an  important  role  because 
of  the  projective  geometric  relation  between  the  3-D  scene  and 
its  2-D  image. 

Dividing  a  picture  into  regions  using  any  criteria,  including 
high-contrast  boundaries,  leads  to  an  abstract  description  in 
terms  of  region  adjacency.  That  is,  each  region  can  be  represent¬ 
ed  as  a  node  in  a  graph  and  edges  connect  nodes  representing 
regions  sharing  a  common  bordor.  Besides  being  informationally 
economical,  the  picture  is  described  in  terms  of  topological 
relations  rather  than  the  exact  positions  and  shapes  of  its  parts. 
Such  a  description  is  invariant  under  a  large  group  of  geometric 
and  other  transformations.  This  approach  has  been  generalized  to 
what  are  called  linguistic  methods  in  pattern  recognition  (see 
Miller  and  Shaw,  BIBPIC  1968  and  Kaneff,  BIBPIC  1970).  In  the 
linguistic  approach  the  relational  description  of  a  picture  is 
more  important  than  simple  classification.  The  relations  may  be 
much  more  general  than  topological  connectivity  but  the  descrip¬ 
tion  is  still  graphical.  The  graph  can  be  modified  by  rules 
from  a  graph  grammar  to  either  parse  or  generate  "legal"  graphs, 
i.e.  those  corresponding  to  valid  pictures.  This  is  a  "more 
intelligent"  process  than  classification  of  pictures  in  the  same 
sense  that  parsing  a  sentence  (word)  in  a  formal  language  is  a 
"more  intelligent"  process  than  simply  recognizing  its  constitu¬ 
ent  words  (symbols) .  It  has  the  important  advantage  over  classi¬ 
fication  that  a  potentially  infinite  class  of  pictures,  i.e. 
including  pictures  never  seen  before,  can  be  recognized  or  des¬ 
cribed.  This  kind  of  relational  analysis  is  an  important 
charactertistic  of  many  scene  analysis  methods.  In  our  psycho¬ 
logical  model,  it  corresponds  to  comprehension  of  a  whole  as 
more  than  the  sum  of  its  perceived  or  recognized  constituent  parts. 


The  scans  is  comprahsndsd  ss  sn  organization ,  rather  than  merely 
a  collection,  of  objects. 

There  are  phenomena  in  the  process  of  projecting  3-D  scenes 
into  their  2-d  images  which  traditional  2-D  pattern  recognition 
techniques  do  not  address.  Paramount  among  these  is  foreshorten¬ 
ing,  a  distortion  in  which  the  2-D  distance  between  image  points 
depends  on  the  orientation  of  the  line  connecting  the  corres¬ 
ponding  3-D  points  relative  to  the  line  of  sight  and  also  on  their 
distance  from  the  image  plane.  The  extreme  case  of  foreshortening 
occurs  when  the  line  connecting  two  3-D  points  is  a  line  of  sight 
(goes  through  the  point  of  projection) .  For  opaque  objects,  the 
usual  case,  this  yields  occlusion,  a  discontinuity  that  abruptly 
erases  part  of  an  object's  image  that  may  contain  important 
features  for  recognition.  This  phenomenon  causes  hidden  surfaces 
and  on  3-D  rotation  of  an  object  in  a  scene  yields  a  succession 
of  2-D  views  that  cannot  be  continuously  transformed  into  each 
other.  Projection  is  a  many-to-one  function  and  therefore  not 
invertible.  This  loss  of  information  and  the  discontinuities 
in  2-D  images  of  continuously  transformed  3-D  objects  cannot  be 
overcome  by  ordinary  feature  extraction,  2-D  geometric  transfor¬ 
mations,  or  template  matching. 

The  information  loss  in  projection  is  structured  in  the 
sense  that  any  two  3-D  points  that  are  mapped  into  one  image 
point  lie  on  a  straight  line  through  the  point  of  projection, 
i.e.  a  line  of  sight.  If  there  are  two  points  of  projection 
then,  the  intersection  of  two  lines  of  sight  uniquely  determines 
the  position  of  a  point  in  three  dimensions,  a  property  useful 
in  binocular  stereoscopic  vision.  Geometrically  related  to  this 
property  is  the  fact  that  a  line  in  three  dimensional  space 
together  with  the  point  of  projection  determine  a  plane  in 
three  space.  This  plane  intersects  the  image  plane  in  a  straight 
line;  hence  3-D  straight  lines  have  2-D  images  in  the  picture 
plane  that  are  also  straight  lines.  This  fact,  together  with 
the  informational  economy  in  representing  straight  lines 
discussed  in  the  paragraph  on  chain-encoding  earlier,  make  it 
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natural  that  many  of  -ha  earliest  efforts  of  scene  analysis  were 
directed  to  analysing  scenes*  containing  straight  lines .  Such 
scenes  contained  polyhedra,  3-D  objects  bounded  by  planar  faces 
which  therefore  intersect  along  straight  line  edges.  The  2-D 
images  of  the  faces  are  regions  bounded  by  straight  lines.  In 
simple  lighting  situations  a  planar  face  is  usually  uniformly 
bright#  but  the  different  orientations  of  faces  relative  to  the 
light  source  and  the  observer  yield  image  regions  of  different 
grey  levels.  Thus#  the  images  of  edges  may  be  found  by  using 
contrast  features.  We  shall  briefly  examine  some  of  the  techniques 
used  for  finding  these  straight  lines  in  gray-level  pictures 
before  discussing  th<*  analysis  of  polyhedral  scenes. 

Identifying  straight  lines  in  gray- level  pictures  was  a 
much  more  difficult  task  than  originally  anticipated  by  researchers 
in  scene  analysis.  It  appeared  that  one  need  only  trace  along 
adjacent  high-contrast  features  in  pictures  of  polyhedra  to  get 
straight  lines#  but  in  reality  such  features  were  often  spuriously 
present  in  noisy  picture  regions  not  near  straight  lines  and 
difficult  tc  detect  in  some  places  along  faint  straight  lines. 
Though  smoothing  (or  low  pass  filtering)  reduces  noise,  it  also 
suppresses  contrast.  Special  contrast  feature  extractors  can  be 
designed,  however,  which  paradoxically  enhance  contrast  while 
suppressing  noise;  they  combine  spatial  smoothing  and  the  taking 
of  spatial  derivatives.  Smoothing  a  function  can  be  accomplished 
by  taking  ihe  convolution  of  the  function  with  a  fixed  smoothing 
function  such  as  the  gaussian  function.  In  Lhe  discrete  veision 
of  the  process#  the  convolution  integral  reduces  to  a  weighted 
average  of  values  in  a  local  neighborhood.  The  result  in  either 
version  is  "blurring"  or  smoothing.  Now,  regions  of  high  contrast 
are  characterized  by  extreme  values  of  derivatives.  In  a  picture, 
derivatives  are  defined  as  the  limits  of  ratios  of  intensity  to 
distance  between  points  at  which  the  intensities  are  observed. 
s - 

From  now  on,  the  word  "scene"  refers  to  the  3-D  configuration 

whose  image  is  projected  onto  a  2-D  picture. 
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In  the  discrete  version  the  limit  cannot  proceed  below  grid  cell 
size,  which  if  taken  to  equal  unity  reduces  the  derivative  to  a 
kind  of  weighted  finite  difference.  Smoothing  and  derivative 
taking  in  the  continuous  case  can  be  conceptually  unified  by 
considering  their  representations  in  Laplace  transform  theory. 
There,  convolution  in  transform  space  corresponds  to  multipli¬ 
cation  of  the  pattern  transform  by  the  transform  of  the  smoothing 
function.  The  transform  of  the  derivative  of  a  pattern  is  simply 
a  constant  times  the  transform  of  the  pattern.  Since  products 
commute,  the  same  result  is  effected  by  taking  the  derivative  of 
a  smoothed  pattern  as  by  taking  the  convolutio  of  the  pattern 
with  the  derivative  of  the  smoothing  function.  In  discrete  form, 
the  latter  coresponds  to  taking  a  weighted  average  of  the 
picture  points  in  a  neighborhood  where  the  weights  may  be  negative 
as  well  as  positive.  Thus,  smoothing  and  derivative  taking  can 
be  accomplished  in  one  step  no  more  complex  computationally  than 
smoothing.  Smoothing  is  analogous  to  a  Oth  order  difference,  and 
various  higher  order  derivatives  are  analogous  to  higher  order 
differences.  In  the  2-D  case  partial  derivatives  correspond  to 
finite  differences  or  weighting  functions  over  neighborhoods 
rather  than  intervals.  The  choice  of  diameters  of  these  neigh¬ 
borhoods  is  critical  in  effective  feature  extraction.  Ordinarily 
"noise"  grain  is  much  finer  than  "signal"  (or  true  picture)  grain. 
If  the  diameter  of  averaging  neighbhoods  falls  between  these  two 
grain  sizes,  both  noise  suppression  and  contrast  enhancement  can 
occur  simultaneously  resulting  in  a  better  signal-to-noise  ratio. 

Almost  all  contrast  feature  detectors  can  be  considered  as 
weighted  averagers  as  described  in  the  preceding  paragraph. 

Examples  are  Shirai 1  s  contrast  detector  (BIBSA  1973)  whj  <~h 
resembles  a  smoothing  and  first  order  derivative  taken  in  a 
direction  perpendicular  to  a  line.  The  Laplacian  operator  is  a 
second  order  partial  derivative;  Horn  (BIBTR  1972  and  BIBPIC  1974) 
discusses  its  discrete  analog.  Hueckel's  (EIBPIC  1971,  1973) 
operators  are  combinations  of  finite  differences  of  various  orders; 
tor  that  reason  they  are  particularly  versatile  in  detecting  and 
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describing  a  broad  range  of  types  of  contrast;  however  they  are  some¬ 
what  more  expensive  computationally  than  simpler  differencing  schemes. 

Smoothing  and  taking  spatial  derivatives  of  a  picture  of  a 
polyhedral  scone  yields  distinctive  ridges  and  valleys  along  the 
images  of  polyhedral  edges  and  yields  flat  regions  where  high 
frequency  noise  or  uniform  intensity  prevail.  The  TRACK  program 
package  developed  at  MIT  and  described  by  Lerman  and  Woodham 
(BXBTR  1973)  reduces  such  a  picture  to  a  line  drawing.  It  uses  a 
Shirai  type  contrast  detector  to  locate  points  of  high  contrast; 
with  these  it  associates  more  distant  points  lying  close  to  the 
best  straight  line  fitting  the  original  points ,  progressing  from 
point  to  point.  A  straight  line  is  thus  extended  in  the  direction 
best  fitting  (according  to  some  mean  square  error  criterion)  those 
points  already  assigned  to  it.  When  no  more  feature  points  are 
found  in  that  direction,  the  line  is  terminated.  Thresholds  can 
be  set  to  "tune"  the  program  to  various  contrast  levels  and 
acceptable  line  lengths.  In  the  end,  the  data  structure  that 
TRACK  presents  to  scene  analysis  programs  consists  of  lines  speci¬ 
fied  by  their  endpoints.  Neither  gray-levels  nor  features  remain. 

Using  statistical  mathematical  models  for  noise,  edge  image 
blurring,  and  light  intensity  variations  over  polyhedral  surfaces 
Griffith  (BIBPIC  1971, 1973)  derived  theoretically  the  acceptance 
criteria  for  straight  line  determination  that  are  found  experi¬ 
mentally  (by  "tuning"  thresholds)  in  the  TRACK  program  package. 

Duda  and  Bart  (BIBPIC  1972)  use  a  rather  different  method  for 
recognizing  straight  lines  in  noisy  situations  which  was  applied 
by  O'Gorman  and  Clowes  (BIBPIC  1973).  It  involves  first  finding 
high-contrast  neighborhoods  and  then  scanning  these  in  two  perpen¬ 
dicular  directions  with  feature  extractors  resembling  first  order 
derivatives.  These  two  "directional  derivatives"  are  components 
of  the  local  light  intensity  gradient.  The  direction  of  any  local 
edge  is  perpendicular  to  this  gradient,  and  can  be  considered  as 
a  small  segment  of  a  straight  line.  The  so-called  Hough  Transform 
is  applied  to  each  small  line  segment  (local  edge)  to  represent  it 
as  a  point  in  a  space  whose  two  dimensions  correspond  to  the  angle 
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the  local  edge  made  with  the  x-axis  in  the  picture  apace  and  the 
diatance  of  the  line  on  which  it  might  lie  from  the  origin.  Any 
extended  line  in  picture  apace  will  yield  a  email  denae  cluater 
of  auch  points  in  Hough  Transform  apace.  The  degree  of  cluster¬ 
ing  is  therefore  used  as  a  criterion  in  deciding  whether  a 
line  exists  in  the  picture;  its  endpoints  can  be  found  by  locating 
extreme  values  of  the  coordinates  of  the  neighborhoods  corres¬ 
ponding  to  the  edges  in  question.  A  crucial  difference  between 
this  method  and  tracking  methods  is  that  orientation  of  a  feature 
and  its  existence  anywhere  on  the  line  locus  are  used  as  evidence 
rather  than  feature  continuity  with  respect  to  tracking  order. 

The  result  is  that  a  line  can  be  recognised  even  if  interrupted 
by  many  large  gaps  or  crossed  by  other  curves;  in  such  situations 
tracking  could  fail.  The  end  result  of  the  Duda-Hart  process  is 
also  a  data  structure  representing  straight  line  segments  whose 
end  points  are  specified. 

If,  in  our  psychological  analogy,  light  intensity  measurement 
corresponds  to  sensation,  then  pattern  recognition,  including 
straight  lire  recognition,  could  be  called  perception.  The  next 
level  up  in  the  hierarchy  is  then  comprehension.  In  artificial 
vision  we  shall  here  consider  comprehension  as  understanding 
something  about  the  3-D  relations  between  parts  whose  2-D  images 
have  been  perceived.  The  goal  of  scene  analysis  is  that  kind  of 
comprehension.  We  will  now  examine  some  of  the  first  approaches 
to  recognizing  scenes  consisting  of  polyhedra  based  on  the  line 
drawings  of  their  images. 


2.3  Approaches  to  Scene  Analysis 

2.3.1  Analysis  of  Straight  Line  Representations  of  Polyhedra 

In  the  history  of  scene  analysis  Roberts  (BIBSA  and  BIBTR 
1965)  is  credited  with  taking  the  first  step  in  computer  interpre¬ 
tation  of  a  2-D  picture  as  a  monocular  view  of  a  3-D  scene. 

His  method  was  an  extension  of  known  picture  processing  and 


pattern  recognition  techniques.  First,  the  grey-scale  picture  is 
reduced  to  a  line  drawing  by  fitting  lines  to  contrast  features. 
Recognition  is  accomplished  by  geometric  transformation  and 
generalized  template  matching,  but  in  the  3-0  realm  rather  than 
the  2-0  picture  space.  Picture  regions  are  considered  to  be  the 
images  of  polygonal  faces  of  polyhedra.  The  polyhedra  in  the 
3-0  scene  are  restricted  to  be  examples  of  certain  known  models, 
hence  the  2-D  images  of  faces  can  only  be  of  certain  known  types, 
subjected  to  projective  distortions.  The  final  image  is  a  product 
of  rigid  motions  of  the  polyhedra  in  three  dimensions  and  the 
projective  transformation.  In  cartesian  coordinates  the  former 
are  linear  and  the  latter  nonlinear  transformations.  Roberts 
overcame  the  computational  difficulties  in  expressing  this  mixture 
by  using  homogeneous  coordinates  which  permit  a  unified  linear 
representation  of  all  relevant  transformations.  This  notation 
originated  in  projective  geometry  (see  Coxeter  BIBGEN  1964)  where 
it  greatly  simplified  and  unified  description  of  phenomena  diffi¬ 
cult  or  impossible  to  describe  in  Euclidean  geometry.  The  next 
step  in  Roberts'  recognition  process  is  to  invert  the  transforma¬ 
tions  which  yielded  the  image  to  identify  the  visible  faces  of 
the  3-D  polyhedra.  Recognition  consists  in  matching  the  resulting 
3-D  structure  with  a  known  model,  a  generalization  of  template 
matching . 

Roberts'  method  cannot  correctly  label  picture  regions  as 
faces  of  blocks  in  a  3-D  scene  when  that  scene  contains  unknown 
types  of  polyhedra  or  known  types  which  partially  occlude  each 
other.  Guzman  (BIBTR  and  BIBSA  1968)  tried  to  overcome  these 
weaknesses  as  well  as  avoid  the  computationally  expensive  numeri¬ 
cal  approach  by  using  more  linguistic,  relational  methods.  Given 

the  line  drawing  of  a  scene,  Guzman's  SEE  program  does  not  use 

the  exact  coordinates  of  points  and  lines  as  does  Roberts'  program 

but  rather  the  geometric  relations  between  them.  SEE  concentrates 
on  points  at  which  several  picture  lines  intersect.  These  junctions 
are  the  images  of  polyhedral  vertices,  the  meeting  points  of 
several  faces.  The  acuteness  or  obtuseness  of  the  angles  between 


these  junction  lines  depends  on  the  viewer's  position  relative  to 
the  planes  of  the  polyhedron.  For  example ,  when  looking  at  a 
corner  of  a  cubical  building  from  the  street,  the  two  visible 
roof  edges  and  the  intersection  of  the  walls  at  the  corner  form 
an  image  junction  which  resembles  an  upward  pointing  arrow.  After 
a  vertical  helicopter  ride  to  a  position  above  the  plane  of  the 
roof,  the  junction  resembles  a  two  pronged  fork;  the  angle  between 
the  shaft  of  the  arrow  and  its  barbs  has  changed  from  acute  to 
obtuse  and  the  roof  surface  has  become  visible.  These  and 
several  other  junction  configurations  are  recognized  by  Guzman's 
program.  From  observing  cases,  heuristics  are  derived  for  infer¬ 
ring  whether  or  not  regions  on  either  side  of  a  junction  line 
should  be  considered  as  images  of  adjacent  faces  of  a  polyhedron. 
An  abstract  model  of  the  scene  is  constructed  by  representing  each 
region  as  a  node  in  a  graph  and  face  adjacency  is  denoted  by 
linking  appropriate  nodes.  The  phase  just  descrioed  yields  a 
representation  based  on  information  local  to  junctions.  A  more 
globally  plausible  structure  is  derived  in  the  next  phase  by 
examining  the  graph  and  ruling  out  certain  unlikely  configurations 
such  as  incorrect  connections  between  distinct  polyhedra. 

The  graphical  representation  of  distinct  bodies  by  Guzman 
has  a  linguistic  flavor  and  is  a  higher  level  abstraction  than 
Roberts'  representation  of  examples  of  models.  This  permits 
analysis  of  much  more  complex  scenes  with  many  objects  partially 
occluding  each  other  and  containing  objects  never  seen  before. 

One  key  to  the  power  of  this  approach  is  that  junction  properties 
exploited  by  the  region-linking  heuristics  are  invariant  under  a 
large  class  of  2-D  and  projective  transformations.  Therefore, 
transformation  inversion  which  Roberts  needed,  is  never  necessary 
Junctions  are  informationally  rich  because  they  capture  at  a 
single  point  the  relations  between  several  extended  parts  (the 
faces)  of  polyhedra. 

Guzman's  program  was  a  first  step  in  junction  analysis,  and 
it  had  several  unforeseen  weaknesses,  ^fter  a  more  careful  analy¬ 
sis  of  scenes  and  their  images,  Rattner  (BIBTR  1970)  embodied  an 
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improved  set  of  heuristics  in  his  program  SEEMORE.  One  of  the 
weaknesses  of  Guzman's  program  was  that  in  scenes  with  many 
objects  close  to  each  other,  shadows  are  thrown  across  faces  and 
their  edges  can  be  misinterpreted  as  the  images  of  polyhedral 
edges.  Orban  (BIBTR  1970)  studied  these  configurations  and 
developed  heuristics  for  detecting  and  removing  such  lines  from 
junctions . 

Waltz  (BIBTR  1972)  attacked  the  same  scene  analysis  domain 
as  Guzman,  Orban,  and  Rattner  but  instead  of  using  a  set  of 
heuristics  based  on  empirical  observations  he  analyzed  geometric 
relations  between  3-D  vertices  and  the  junctions  that  are  their 
projective  images.  His  fundamental  object  of  analysis  was  also 
the  junction  type,  but  instead  of  considering  only  shape,  he 
examined  the  3-D  configurations  that  could  give  rise  to  such 
shapes.  He  then  labelled  junction  lines  with  descriptors  of  the 
3-D  properties  they  could  correspond  to.  These  labels  were 
originated  independently  by  both  Huffman  (BIBSA  1971)  and  Clowes 
(BIBSA  1971)  to  distinguish  between  3-D  edges  which  are  concave, 
convex,  occluding  bordering  surfaces  or  cracks  between  objects. 
Huffman  used  these  four  labels  to  determine  whether  or  not  line 
drawings  could  be  interpreted  as  pictures  of  "real"  objects. 

This  involves  attaching  labels  to  lines  and  progressing  to  other 
lines  through  junctions  until  all  lines  are  labelled.  The 
existence  of  one  kind  of  label  at  a  junction  constrains  the 
label  of  other  lines.  Any  line  can  only  be  assigned  a  single 
label.  If  it  is  impossible  to  label  a  picture  consistent  with 
these  constraints,  it  could  not  be  the  picture  of  a  real  object. 
An  example  is  the  "devil's  pitchfork",  a  picture  which  locally 
looks  "real"  but  is  inconsistent  globally.  Waltz  exhaustively 
examined  all  possible  views  of  vertices  where  three  (and  in  a 
few  cases,  more)  polyhedral  faces  met  and  labelled  the  lines 
accordingly.  For  any  particular  junction  shape,  a  large  number 
of  possible  labellings ,  each  corresponding  to  a  distinct  3-D 
configuration,  is  possible. 

Waltz'  program  associates  with  each  junction  in  a  picture 
a  list  of  all  its  possible  labellings.  Those  are  ruled  out  for 
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which  no  adjacent  junction  haa  a  labelling  compatible  along  the 
line  they  share.  Waltz  originally  thought  a  tree  search  would  be 
necessary  to  examine  all  consistent  labellings,  with  the  accumula¬ 
tion  of  interacting  constraints  leading  to  a  single  or  few 
possible  complete  picture  labellings.  To  his  surprise,  most 
junction  labellings  were  ruled  out  in  simply  progressing  from 
junction  to  junction.  The  reason  was  that  the  apparent  increase 
in  descriptive  complexity  is  more  than  offset  by  the  fact  that 
the  number  of  geometrically  realizable  labellings  of  any  parti¬ 
cular  junction  shape  is  far  smaller  than  the  number  of  combinations 
possible  if  the  lines  could  be  independently  labelled.  Thus 
compatibility  between  adjacent  junctions  is  much  less  likely  than 
if  lines  were  labelled  independently.  Waltz  next  introduced 
shadow  labellings  for  lines;  exactly  the  same  phenomenon  occurred 
resulting  in  even  better  scene  analysis  with  no  tree  searching 
necessary.  Shadows,  instead  of  interfering  with  scene  analysis 
as  in  previous  approaches,  actually  contribute  information. 

Newborn  (BIBTR  1974)  embodied  Waltz's  picture  labelling  algo¬ 
rithms  in  New  York  University's  high-level  set  theoretic  language 
SETL. 

The  descriptive  labels  on  the  lines  of  an  analyzed  picture 
can  easily  be  used  to  derive  object  identity.  Waltz  goes  beyond 
Guzman,  however,  in  that  even  more  geometric  information  about 
the  3-D  scene  is  found.  Crude  shape  descriptors  for  edges  tell 
something  about  the  relative  positions  of  planes  in  the  3-D 
scene.  A  drawback  is  that  instead  of  a  small  number  of  junction 
labels,  a  large  dictionary,  with  thousands  of  entries,  must  be 
stored  or  computed. 

Waltz's  improvement  over  Guzman  is  a  result  of  generalizing 
Guzman's  heuristics  which  were  seen  as  special  cases  of  the 
geometric  semantics  relating  3-D  vertices  and  their  2-D  images 
(junctions) .  Mackworth's  program  POLY  (BIBSA  1973)  is  an 
extension  in  this  tradition.  His  approach  is  based  on  the 
following  geometric  representation.  Any  2-D  picture  of  a  3-D 
scene  consists  of  the  projection  of  the  latter's  visible  points 
through  the  image  plane  to  a  single  special  point  called  the 
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viewpoint.  In  a  camera,  for  example,  the  photographic  film 
occupies  the  image  plane  and  the  viewpoint  lies  in  the  lens. 

The  2-D  image  of  a  line  in  the  3-D  scone  is  also  a  line.  This 
image  line  and  the  viewpoint  determine  a  plane  which  must  also 
contain  the  line  in  the  3-D  scene  which  gave  rise  to  the  image. 
This  is  called  the  "plane  of  interpretation"  by  Mackworth;  the 
polygonal  image  of  a  polyhedral  scene  yields  a  bundle  of  planes 
of  interpretation  all  of  which  contain  the  viewpoint.  Now,  if 
the  scene  consists  of  polyhedral  objects,  their  faces  intersect 
along  straight  line  edges  whose  images  determine  the  planes  of 
interpretation.  The  relations  between  the  orientations  of  these 
sets  of  planes  c  n  be  exploited  to  infer  the  former  from  the 
latter.  The  cumbersome  scene  representation  of  planes  as  sets 
of  points  is  avoided  by  representing  each  plane  as  a  single  point 
in  dual  space.  In  brief,  any  plane  in  three  space  can  be 
characterized  by  the  direction  of  its  normal  and  its  distance 
fronrthe  origin;  thu3  in  dual  space  each  plane  is  represented 
by  a  point  at  the  tip  of  a  vector  starting  at  the  origin  pointing 
in  the  direction  of  the  normal  and  having  magnitude  equal  to  the 
distance  of  the  plane  from  the  origin.  Relations  between  planes 
have  the  following  representations  in  dual  space.  A  set  of 
parallel  planes  corresponds  to  a  set  of  points  lying  on  the  same 
line  through  the  origin.  The  intersection  of  a  number  of 
planes  in  a  single  point  (polyhedral  vertex)  corresponds  to  the 
vertices  of  a  planar  polygon  in  dual  space.  All  planes  of  inter¬ 
pretation  axe  represented  as  points  in  dual  space  lying  in  a 
single  plane;  that  plane  is  the  dual  of  the  viewpoint.  The  edges 
bounding  a  single  face  of  a  polyhedron  correspond  in  dual  space 
to  lines  passing  through  a  single  point,  the  dual  of  the  planar 
face. 

Mackworth 's  program  POLY  operates  on  the  dual  space  repre¬ 
sentation  of  scenes  and  their  images  to  determine  edge  types 
(such  as  Huffman  and  Clowes  types) ,  surface  orientation,  body 
identification,  and  hidden  structure.  It  tries  to  link  neigh¬ 
boring  image  regions  by  labelling  the  lines  separating  them  as 
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the  images  of  "connect"  edges  in  the  scene,  i.e.  intersections 
of  the  planes  whose  images  are  '  regions.  In  dual  space, 
identifying  a  connect  edge  between  two  planes  corresponds  to 
drawing  the  dual  of  the  edge  (a  line)  through  the  duals  of  the 
planes  (two  points) .  Progressing  from  region  to  region  in  the 
image  corresponds  to  connecting  points  with  straight  lines  in 
dual  space.  Constraints  imposed  by  incidences  in  the  image  have 
their  counterpart  constraints  in  dual  space,  preventing  many 
edges  from  being  interpreted  as  connect  edges.  Since  the  back¬ 
ground  ordinarily  consists  of  a  single  plane,  the  program  begins 
by  attempting  to  connect  it  to  all  regions  bordering  it.  There, 
the  large  number  of  simultaneous  constraints  greatly  reduces  the 
amount  of  backtracking  necessary  in  searching  for  correct  picture 
interpretations.  The  eventual  aim  is  a  picture  in  which  every 
edge  is  labelled  as  connecting  or  occluding  two  neighboring 
regions.  Polyhedral  objects  in  the  scene  correspond  to  poly- 
hedra  in  dual  space;  completing  missing  portions  in  the  latter 
corresponds  to  solving  hidden  line  and  surface  problems  in  the 
former.  Pictures  for  which  this  is  impossible  are  generaliza¬ 
tions  of  Huffman's  illegal  or  "nonsense"  sentences,  for  example, 
the  "devil's  pitchfork"  mentioned  in  the  discussion  of  Waltz' 
approach . 

Mackworth's  purely  geometric,  projective  approach  appears 
superficially  to  be  a  step  backward  from  the  abstract,  symbolic 
approaches  of  Guzman  and  Waltz  to  the  numerical  methods  of  Roberts. 
That  this  is  not  so  stems  from  Mackwroth's  dual  space  represen¬ 
tation  in  which  points  correspond  not  to  points  in  the  scene  but 
to  sets  of  points  (planes)  bearing  some  relation  to  each  other. 
Roberts'  approach  was  based  essentially  on  transformation  and 
template  matching,  though  with  a  novel  twist;  in  Mackworth's 
approach  object  identification  results  from  interpreting  the 
dual  graph  in  terms  of  connectivity  rather  than  shape  and  posi¬ 
tion  of  picture  parts.  This  extraction  of  high-level  relations 
is  in  the  spirit  of  Guzman  and  Waltz  and  is  one  property  which 
characterizes  artificial  intelligence  approaches  as  opposed  to 
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pattern  recognition  approaches.  In  addition,  however,  the  dual 
space  representation  can  yield  geometric  information  about  the 
scene  inaccessible  to  Guzman  and  Waltz.  Distances  and  angles  in 
dual  space  correspond  to  quantities  describing  geometric  rela¬ 
tions  between  entities  in  the  scene.  Among  these  are  orientations 
of  faces  and  edges;  Waltz's  classification  of  these  was  restrict¬ 
ed  to  binary  distinctions  such  as  convex  vs.  concave.  Waltz's 
dictionary  of  junction  types  is  a  list  of  special  cases  of 
phenomena  Mackworth  can  represent  in  their  entirety.  Waltz's 
surprise  at  the  small  number  of  realizable  parsings  of  sentences 
over  words  in  this  dictionary  would  have  vanished  if  he  had  known 
about  the  general  constraints  of  positioning  points  and  lines  in 
dual  space.  Mackworth' s  dual  space  representation  is  capable  of 
yielding  an  infinite  dictionary, including  all  the  difficult  cases 
of  polyhedral  vertices  formed  by  more  than  three  coincident  planes . 
Though  he  did  not  include  it,  there  is  room  for  representing 
shadows  as  projections  of  edges  through  the  light  source  onto  a 
surface  in  the  scene  and  then  through  the  viewpoint  to  the  image 
plane.  This  new  projection  point  (the  light  source)  ought  to  add 
many  of  the  scene  analysis  capabilities  of  binocular  visual  systems; 
the  dual  space  equivalent  of  this  point  is  a  plane  which  would 
further  constrain  relations  between  dual  space  points. 

The  polyhedral  scene  analysis  programs  just  described  (from 
Roberts  through  Mackworth)  are  characterized  as  "bottom  up" . 

That  is,  before  scene  analysis  proper  begins,  small  regions  of 
high  local  contrast  (features)  are  found  and  linked  together  by 
least  squares  or  similar  statistical  methods  into  straight  line 
segments.  These  line  segments  are  the  primitives  on  which  the 
scene  analysis  programs  operate.  A  serious  drawback  of  this 
approach  is  that  feature  detection  is  very  sensitive  to  threshold 
and  appropriate  thresholds  may  differ  in  different  parts  of  the 
picture  depending  on  the  high-level  (semantic,  3-D  geometric) 
structure  of  the  scene.  This  high-level  information  cannot  be 
known  in  advance  of  the  low-level  or  line  finding  phase  of  the 
operation.  In  using  a  uniform  threshold  throughout  the  picture, 
high  sensitivity  yields  spurious  feature  detection  where  noise 
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is  present.  Noise  is  inevitably  high  due  to  camera  imperfections, 
bad  lighting,  uneven  reflective  properties  of  objects  in  the 
scene  and  digitization  truncations  (HORN,  BIBTR  1969) .  Lowering 
sensitivity  of  feature  finders  to  avoid  false  positive  detections 
results  in  failure  to  detect  faint  lines  between  regions  which  are 
the  images  of  nearly  equally  bright  adjacent  faces  of  a  polyhedron. 

Falk's  (BIBSA  1971,1572  and  BIBTR  1970)  approach  to  solving 
the  problems  described  in  the  preceding  paragraph  was  to  append  a 
"top-down"  phase  to  the  bottom-up  phase.  Spurious  lines  are 
removed  and  missing  lines  are  added  after  the  initial  bottom-up 
phase  by  considering  the  initial  line  drawing  as  the  projected 
image  of  a  scene  consisting  of  a  collection  of  polyhedra  of  known 
types.  Inverting  the  scene-to-image  projection  in  the  manner  of 
Roberts  and  solving  hidden  line  and  surface  problems  leads  to  a 
best  model  of  the  scene  in  terms  of  the  known  polyhedral  types. 
Missing  or  extra  lines  resulting  from  the  bottom-up  phase  are 
added  or  deleted  respectively  to  make  the  final  line  drawing 
conform  to  the  model  of  the  scene.  Thus,  it  differs  from  Roberts' 
approach  in  that  a  Guzman-like  bottom-up  phase  first  predicts 
the  scene  objects  and  only  after  that  relatively  cheap  computation 
(list  processing  rather  than  matrix  operations)  does  the  projec¬ 
tive  analysis  occur.  The  advantage  is  not  only  greater 
simplicity,  but  the  Guzman-like  phase  of  the  operation  is  far 
superior  in  analyzing  scenes  with  large  numbers  of  objects  which 
occlude  portions  of  each  other.  One  drawback  of  this  method  is 
that  the  scene  is  limited  to  collections  of  objects  from  a  fixed 
set.  The  hidden  line  and  surface  algorithms  are  also  computation¬ 
ally  expensive. 

Shirai  (BIBTR  1972  and  BIBSA  1973)  overcame  many  of  the  draw¬ 
backs  of  the  bottom-up  approach  using  a  method  that  looks  decep¬ 
tively  low-level,  but  succeeds  as  a  result  of  efficient  use  of 
heuristics  based  on  high-level  properties  of  polyhedral  scenes. 

His  overall  strategy  ip  to  proceed  from  the  most  distinct  (high¬ 
est  contrast)  edges  to  the  least,  searching  for  lines  only  in 
places  suggested  by  earlier,  stronger  evidence.  Lines  are 
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classified  into  three  categories.  Those  in  the  first ,  contour 
lines,  are  characterized  at  the  low  (picture  processing)  level 
as  consisting  of  picture  points  whose  grey  levels  contrast  greatly 
with  local  neighbors.  At  the  high  (3-D  scene  property  level, 
contour  lines  correspond  to  the  separation  between,  for  example, 
a  light  object  seen  against  a  dark  background.  The  second 
category  consists  of  boundary  lines  which  are  characterized  at 
the  low  level  by  slightly  smaller  contrast  values;  at  the  high 
level  they  correspond  to  the  separation  between  an  object  and 
another  object,  or  between  an  object  and  the  background.  Contour 
lines  are  special  cases  of  boundary  lines.  The  third  category, 
internal  lines,  contains  those  exhibiting  least  contrast;  they 
correspond  to  lines  separating  adjacent  faces  of  a  body. 

Shirai • s  process  is  begun  by  sampling  a  grey  scale  picture 
once  in  every  8x8  cell  region.  Samples  whose  grey  levels  differ 
greatly  from  the  picture  average  are  singled  out  as  contour  points; 
full  resolution  is  restored  and  the  same  criterion  is  applied  to 
find  contour  points  missed  by  the  original  sampling  in  the  8x8 
neighborhoods  of  those  found  by  the  original  sampling.  The  chains 
of  contour  points  resulting  from  this  process  are  traced  and 
broken  into  straight  lines  whose  endpoints  are  local  maxima  of 
hain  curvature.  These  constitute  a  set  of  contour  lines,  strong 
candidates  for  delineating  the  outlines  of  objects  in  the  image 
of  a  scene  because  an  object  and  its  background  usually  differ 
more  from  each  other  in  optical  properties  and  position  relative 
to  the  light  source  than  adjacent  faces  of  the  same  object  or 
nearby  faces  of  neighboring  objects  of  a  similar  type.  This  is 
an  example  of  the  semantics  of  projective  geometry  relating  3-D 
objects  and  their  2-D  images  embodied  in  Shirai' s  heuristics. 

Most  of  the  other  nine  heuristics  involve  attempts  to  extend  lines 
from  their  endpoints  or  find  lines  going  in  new  directions 
starting  at  the  endpoints  of  other  lines.  The  semantic  justi¬ 
fication  for  these  heuristics  is  obvious  from  looking  at  the 
junction  types  of  Guzman  or  Waltz.  That  is,  lines  in  a  picture 
are  images  of  polyhedral  edges.  Edges  terminate  in  vertices  with 
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other  edges.  Therefore  their  images  are  junctions  of  lines ,  the 
logical  place  to  look  for  extensions  of  old  lines  or  beginnings 
of  new  ones.  Since  an  occluded  neighboring  object  often 
contrasts  less  with  its  occluder  than  the  occluder  with  the 
background,  the  extension  of  a  contour  line  along  the  top  of  a 
TEE  junction  is  usually  a  boundary  line.  This  is  embodied  in 
heuristics  by  increasing  the  sensitivity  of  the  line  finder  when 
searching  for  the  extension.  Similarly  motivated  heuristics  are 
used  to  find  all  boundary  lines.  Finally,  the  sensitivity  is 
increased  even  more  to  find  internal  lines  at  likely  places  such 
as  the  stems  of  arrows. 

The  final  product  of  Shirai's  system  is  a  line  drawing  of  a 
scene,  with  lines  labelled  as  contours,  boundaries,  and  internal 
lines.  Individual  objects  (polyhedra)  are  identified  by  tracing 
contour  lines  and  their  extensions  into  boundary  lines;  this 
usually  yields  a  simple  closed  path  that  is  the  outline  of  an 
object.  The  selective  use  of  increased  line  finder  sensitivity 
in  places  determined  by  earlier  stronger  evidence  results  not 
only  in  considerable  computational  economy  but  also  in  the  avoid¬ 
ance  of  false  positive  line  detection  in  noisy  regions.  There 
are  two  underlying  principles  behind  Shirai's  approach.  The 
first  is  careful  exploitation  of  the  projective  semantics  which 
relate  straight  line  edges  in  a  scene  to  their  straight  line 
images .  The  other  is  the  constant  interplay  between  high  level 
knowledge  of  scene  properties  and  low  level  line  following. 

This  interplay  is  an  example  of  heterarchical  rather  than 
hierarchical  control.  The  term  heterarchy  was  used  by  McCulloch 
(BIBGEN  1945)  to  describe  neural  networks  in  which  feedback 
destroys  the  usual  distinction  between  high  and  low  level  control. 
In  "analogy  with  linear  systems  theory,  fev»dback  often  gives 
systems  greater  dynamic  range,  flexibility,  and  computation  power 
than  their  hierarchical  counterparts.  Minsky  (BIBTR  1970) 
suggested  that  heterarchical  control  might  be  valuable  in  AI 
systems.  This  contention  is  strongly  borne  out  by  Shirai's  system 
which  not  only  often  yields  far  better  line  drawings  with  less 
computation  than  conventional  bottom-up  appr^0ches  but  also 
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achieveii  rather  good  high  level  separation  of  objects  and  identi- 
fication  of  edge  types.  Remarkably,  this  higher  level  "semantic" 
type  of  information  is  acquired  without  the  use  of  tree  searches, 
dictionaries  of  junction  types,  or  projective  transformations 
characteristic  of  other  approaches.  These  omissions  prevent 
Shirai's  system  from  separating  neatly  stacked  bodies  and  detecting 
certain  kinds  of  concavity,  i.e.  situations  which  are  not  pointed 
to  by  the  images  of  convex  edges;  the  lack  of  3-0  semantics  is  a 
limitation.  However,  Shirai's  program  could  be  used  as  a  superior 
(to  conventional  line-finders)  preprocessor  for  bottom-up  programs, 
though  such  service  violates  the  spirit  of  heterarchyl 

This  completes  discussion  of  polyhedral  scene  analysis  based 
on  straight  lines  and  their  junctions.  Combinations  of  this 
approach  with  semantic  methods  embodying  knowledge  of  physical 
structure  and  purpose  will  be  discussed  after  describing  other 
methods  which  can  also  be  so  combined.  The  first  of  these  is 
region  analysis. 


2.3.2  Region  Analysis 

Minsky  and  Papert  (BIBGEN  1967)  suggested  avoiding  the  diffi¬ 
culties  of  line  finding  by  constructing  the  regions  that  might  be 
bounded  by  such  lines  directly,  joining  neighboring  cells  which 
have  similar  light  intensities.  Brice  and  Fenema  (BIBSA  1970) 
incorporated  this  suggestion  in  a  scene  analysis  system.  The 
first  step  is  to  scan  a  picture,  introducing  a  short  boundary 
between  any  two  neighboring  cells  whose  intensity  values  differ 
by  more  than  some  threshold.  The  magnitude  of  the  difference  in 
intensities  is  stored  in  association  with  each  such  boundary. 

In  terms  of  our  earlier  discussion  of  contrast  features,  this 
process  yields  first  order  differences,  analogous  to  first  order 
partial  derivatives  with  respect  to  the  coordinate  axes.  The 
resulting  boundaries  resemble  the  edge-elements  of  O' Gorman  and 
Clowes,  yielding  a  picture  with  many  short-perimeter  regions 
bounded  by  these  edge  elements.  Brice  and  Fenema 's  departure 
from  previous  contrast  detecting  methods  at  this  point  consists 
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in  merging  these  small  regions  according  to  heuristics  baaed  on 
qualities  of  regions  rather  than  line-fitting.  The  first  of 
these,  the  phagocyte  heuristic,  merges  adjacent  regions,  erasing 
their  common  boundary  if  the  contrast  along  it  is  low  and  the 
ratio  of  its  length  to  the  perimeter  of  the  shorter  region  is 
large.  The  result  is  a  less  choppy  picture  with  many  of  the 
former  small  enclosed  or  almost  enclosed  regions  being  merged 
with  larger  ones.  Next,  a  weakness  heuristic  is  applied, 
merging  regions  if  the  ratio  of  the  length  of  low  contrast 
common  boundary  to  length  of  common  boundary  is  high.  This 
differs  from  the  first  in  operating  without  regard  to  perimeter 
length.  Thus  the  phagocyte  heuristic  cleans  up  small  enclosed 
islands  which  are  unlikely  to  be  real  region  outlines  and  the 
weakness  heuristic  reduces  the  tendency  to  see  spurious  large 
region  outlines  where  intensity  gradients  are  present  but  weak. 

The  next  step  in  Brice  and  Fenema's  scheme  is  to  find 
vertices  (places  where  three  regions  meet)  and  join  adjacent 
ones  along  their  connecting  region-boundary  with  straight  line 
masks.  The  latter  are  thin  rectangles  anchored  at  vertices; 
their  widths  represent  limits  of  acceptable  deviation  of  boundary 
points  from  a  straight  line  joining  vertices.  After  successful 
fitting,  each  mask  is  replaced  by  a  line  and  the  picture  is 
represented  as  a  line  drawing.  Guzman-like  techniques  are 
applied  to  propose  objects,  which  are  assumed  to  be  wedges, 
cubes,  wall  or  floor.  The  last  two  constituents  are  also  sought 
on  the  basis  of  their  vertical  location  in  the  picture.  Line 
drawings  are  corrected  in  a  manner  similar  to  Falk's;  missing 
or  extra  lines  are  added  or  deleted  respectively  on  the  basis 
of  models  of  objects.  This  lack  of  object  generality,  is  not 
the  fault  of  region-growing  but  crudeness  of  higher-level 
processing.  This  is  one  of  the  first  attempts  at  region-growing 
and  predates  the  work  of  Waltz  and  Mackworth,  who  had  more 
sophisticated  high-level  representations. 

In  deriving  the  line  drawings  of  polyhedral  scenes,  region 
growing  and  line-finding  should  yield  the  same  result.  A  picture 
specified  by  a  line  drawing  can  just  as  well  be  specified  by 
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the  zegions  those  lines  enclose.  However,  region-growing  ought 
to  be  more  robust  than  line-finding  because  the  heuristics  which 
merge  regions  account  for  global  properties  of  intensity  distri¬ 
butions  throughout  a  region.  This  is  usually  a  much  larger  area 
than  the  small  neighborhoods  used  in  line-finding  so  local  high 
spatial  frequency  noise  is  much  less  disruptive. 

The  usefulness  of  the  heuristics  in  the  region-growing 
method  just  described  is  predicated  on  the  existence  of  rather 
large  regions  with  simple  boundaries,  not  necessarily  straight 
lines.  Line  fitting  was  the  result  of  a  later  pass  designed 
specifically  for  polyhedral  scenes.  An  attractive  feature  of 
region  growing  is  that  it  may  be  applied  to  more  natural  scenes 
whose  Images  contain  no  straight  lines.  Regions  may  correspond 
to  irregularly  shaped  areas  with  characteristics  distinct  from 
the  surround.  These  characteristics  need  not  be  restricted 
to  light  intensity  but  may  include  texture,  color  ,  or  binocular 
disparity  between  two  pictures.  The  characteristic  used  need  not 
have  constant  value  throughout  the  region  but  simply  be  free  from 
sharp  discontinuities.  Systems  embodying  general  region  analysis 
will  be  described  in  the  following  paragraphs. 

Krakauer  (BIBTR  1971)  embodied  a  unique  kind  of  region 
analysis  in  a  program  to  distinguish  between  various  fruits  such 
as  apples  and  pears.  This  task  is  representative  of  those  in 
which  humans  easily  recognize  objects  though  examples  within 
a  class  may  differ  in  ways  difficult  to  express  explicitly. 
Recognition  cues  seem  to  include  gross  2-D  image  shape,  3-D  shape 
and  reflective  properties  inferred  from  intensity  gradients  and 
roughness  or  texture.  All  of  these  are  incorporated  in  a  simple 
tree  structure  derived  from  an  intensity  contour  map.  The  tree's 
structure  can  be  visualized  by  considering  an  intensity  contour 
map  as  a  set  of  stacks  of  planar  regions  of  uniform  thickness, 
each  level  corresponding  to  picture  points  with  higher  light 
intensities  than  the  threshold  associated  with  that  level.  As 
examples,  a  uniformly  bright  disk  of  light  corresponds  to  a  stack 
of  poker  chips  and  a  disk  with  progressive  darkening  away  from 
its  center  corresponds  to  a  conical  stack.  Each  distinct 
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connected  region  (and  there  may  be  many  in  a  complex  picture)  at 
any  particular  level  corresponds  to  a  distinct  node  in  the  tree 
at  that  level.  If  one  region  supports  a  region  above  (that  is 
if  the  boundary  of  the  former  encloses  the  boundary  of  the  latter) 
a  path  connects  the  corresponding  nodes.  Thus,  the  tree  structure 
describes  set-nesting  relations  of  contour  curves.  A  picture 
of  a  "speckled”  object  corresponds  to  a  "bumpy"  contour  map  with 
many  disconnected  regions;  this  in  turn  corresponds  to  a  tree  with 
many  branches.  Krakauer  distinguishes  between  types  of  such 
texture  by  plotting  number  of  branches  against  tree  level.  This 
profile  has  characteristics  immune  to  geometric  transformation 
of  the  object  being  recognized,  changes  in  illumination,  and 
irrelevant  texture  details.  In  addition  to  the  structure  of  the 
tree,  each  node  can  carry  information  about  the  size  or  shape  of 
the  corresponding  region.  For  example,  pears  can  be  distinguished 
from  apples  by  the  eccentricity  or  elongation  of  regions  at  most 
levels.  The  advantage  of  this  kind  of  measure  is  that  statistics 
can  be  gathered  on  regions  at  fixed  levels  easily  even  if  their 
boundaries  are  not  "clean".  That  is,  ragged  edges,  holes,  and 
sharp  turns  need  not  be  eliminated  by  region  growing  heuristics 
in  order  to  measure  useful  properties  of  regions . 

Barrow  and  Popplestone  (BIBSA  1971)  used  region  analysis  in 
a  somewhat  more  general  manner  than  Krakauer  to  recognize 
pictures  of  household  objects  including  a  teacup,  hammer,  wedge, 
spectacles,  pencil  and  the  like.  Regions  are  found  using  a 
merging  algorithm  in  which  all  cells  in  any  particular  region 
must  have  gray-levels  within  a  narrow  range.  When  ranges  overlap, 
regions  may  overlap.  Instead  of  defining  a  tree  using  set-contain¬ 
ment  and  intensity  thresholds,  Barrow  and  Popplestone  define  a 
graph  which  expresses  more  general  relations  between  regions. 

Each  region  corresponds  to  a  node  and  links  between  nodes 
correspond  to  relations  between  regions  such  as  relative  position 
(above,  beside) ,  relative  size  (bigger) ,  distance,  and  shape  of 
adjoining  boundary.  In  addition,  each  node  carries  descriptive 
information  about  the  corresponding  region  such  as  shape  and 
brightness.  These  cues  were  subjectively  chosen  to  correspond 
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to  those  which  might  distinguish  common  objects  in  cartoon-like 
drawings;  such  drawings  resemble  the  result  of  region  growing 
programs  operating  on  gray-level  pictures. 

Recognition  in  the  Barrow  and  Popplestone  approach  consists 
in  matching  the  relational  graph  of  a  picture  with  that  of  the 
model  it  resembles.  This  involves  matching  not  only  graph  shape 
but  also  the  attributes  attached  to  nodes  and  links.  The  model 
comes  from  training  sessions  in  which  samples  of  known  objects 
are  processed  to  gather  statistics  on  the  resulting  graphs.  The 
system  performed  85%  correct  recognition  in  distinguishing  nine 
household  objects,  using  an  average  of  five  minutes  of  computer 
time  per  picture  on  an  ICL  4130. 

Rogion  analysis  and  other  approaches  can  be  combined  with 
semantic  methods  to  improve  scene  analysis.  These  topics  will  be 
discussed  immediately  after  the  next  section  which  deals  with 
numerical  approaches  exploiting  geometric  semantics. 


2.3.3  Numerical  Region  Analysis  Methods  Using  Geometric  Semantics 

Though  the  region  analysis  methods  just  described  allow  great¬ 
er  flexibility  in  scene  types  and  problem  solving  tools  than  line 
analysis  methods,  the  loss  of  projective  semantics  relating  scenes 
to  their  images  is  a  serious  drawback.  For  example,  a  shadow 
thrown  across  a  region  leads  to  its  erroneous  separation  into 
distinct  regions  in  the  graphical  description  of  the  picture. 
Occlusion  of  objects  by  others  has  similar  undesirable  results. 

For  this  reason  the  methods  of  Krakauer,  and  Barrow  and  Popplestone, 
fail  to  correctly  identify  objects  when  applied  to  scenes  with 
many  objects.  The  ad-hoc  heuristics  on  which  recognition  is 
based  are  not  easily  extended  co  account  for  such  projective 
phenomena.  An  approach  by  Horn  (BIBTR  1970)  analyzes  regions  but 
in  a  strictly  numerical,  geometric  way,  accounting  precisely  for 
the  projective  relations  between  object  and  image.  The  goal  is 
to  infer  3-D  shape  of  smooth  objects  from  gentle  (as  opposed  to 
edges)  gradations  in  light  intensity  (shading)  in  their  images. 

The  underlying  assumption  is  that  the  object's  surface  is 
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optically  uniform.  That  is,  in  approximating  any  small  area  of  a 
surface  by  its  tangent  plane,  the  relative  light  intensity  of  its 
image  is  a  function  of  the  relative  directions  of  its  normal  and 
lines  to  the  light  source  and  observer.  Different  models  of 
surface  optical  properties  lead  to  different  functions.  To 
invert  such  a  function  and  infer  the  3-D  surface  shape  requires 
solving  a  first-order  nonlinear  partial  differential  equation 
which  can  be  reduced  to  a  system  of  five  ordinary  differential 
equations.  Horn's  rigorous,  mathematical  approach  not  only  enables 
him  to  incorporate  heuristics  to  tune  the  parameters  of  numerical 
methods  to  solve  the  system  of  equations  efficiently,  but  also 
leads  to  interesting  observations  on  the  significance  of  shading 
in  human  vision.  The  roles  of  cosmetic  makeup  and  lighting 
techniques  in  photography  are  discussed  in  terms  of  the  mathemati¬ 
cal  model.  Horn's  approach  is  strictly  mathematical;  it  doesn’t 
produce  the  abstract,  relational  data  structures  usually  associated 
with  AI.  Just  as  in  Mackworth's  approach,  however,  such  data 
structures  can  be  derived  from  the  inferred  3-D  shape.  For 
example,  the  visual  boundaries  of  any  smooth  convex  object  are 
characterized  by  surface  normals  perpendicular  to  the  line  of 
sight;  thus  the  normals  could  be  used  to  identify  individual 
objects.  Contour  maps  of  3-D  depth  of  objects  should  provide  a 
more  reliable  base  for  analyzing  relations  between  picture  regions 
than  simple  intensity  contour  maps  because  the  former  are  invariant 
under  3-C  transformations  with  suitable  change  in  viewing  position 
alone,  while  the  latter  are  not.  Shadows  cast  by  objects  onto 
others  are  used  to  infer  shapes  and  relative  positions;  they  do 
not.  interfere  as  in  the  other  region  analysis  methods. 

High  contrast  loci  (edges  or  region  boundaries  in  earlier 
methods  discussed  earlier)  are  the  terminators  of  Horn's  solution 
paths  rather  than  being  the  objects  of  attention  as  in  line-finding 
methods.  In  Ramer's  approach  (BIBTR  1973  and  BIBPIC  1975)  they 
play  the  latter  role,  but  in  scenes  which  can  be  as  general  as 
Horn's.  This  suggests  fruitful  combination  of  the  two  methods 
for  identifying  (isolating)  distinct  objects  in  real  world  scenes 
and  inferring  their  3-D  shapes.  The  capabilities  of  the  two 
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methods  are  complementary.  Rainer  uses  Hueckel's  operator  (BIBPIC 
1971,1973)  to  detect  the  positions,  orientations,  and  strengths 
of  high-contrast  edges  in  a  picture.  He  uses  Freeman's  chain¬ 
encoding  to  approximate  their  orientations  and  links  short 
chains  into  longer  ones  using  heuristics  based  on  statistical 
confidence  criteria  such  as  strength,  length  of  chain,  local 
signal  to  noise  ratio,  and  directions  of  adjacent  neighbors. 

Long  chains  resulting  from  this  process  are  classified  as  shadow 
edges,  cracks,  texture,  boundaries  of  bright  specular  reflection 
(highlights)  and  object  boundaries  on  the  basis  of  constituent 
edge  properties  and  relations  to  other  chains.  For  example, 
shadow  edges  are  generally  less  sharp  than  boundaries  of  objects 
in  a  scene;  the  chains  corresponding  to  the  former  are  therefore 
characterized  by  lower  contrast  features  and  a  broader  range  of 
local  variation  in  direction  of  constituent  edges.  Resulting 
chains  are  fitted  to  quadratic  curves;  this  is  the  simplest 
computational  extension  of  linear  interpolation.  The  increase  in 
computational  complexity  yields  more  general  boundary  curve  detec¬ 
tion.  Quadratic  curves  are  particularly  important  because  they 
are  the  images  of  sections  of  quadric  surfaces  (spheres,  cylinders, 
ellipsoids,  cones,  etc.)  which  bound  objects  in  a  more  general 
class  than  polyhedra.  Methods  for  fitting  quadric  surfaces  to 
objects  are  described  by  Agin  (BIBTR  1972)  and  Agin  and  Biiford 
(BIBSA  1973).  In  a  sense,  Ramer's  approach  does  for  this  class 
of  objects  what  Shirai's  method  did  for  polyhedra;  namely,  it 
yields  a  classification  of  curved  edges  which  can  be  used  to 
determine  3-D  structure  of  objects  viewed.  The  tools  used  in 
these  two  cases  are  quite  different,  however. 

Another  purely  numerical  method  exploiting  the  projective 
semantics  relating  a  3-D  scene  to  its  2-d  image  is  used  by  Hannah 
(BIBTR  1974).  While  Horn's  method  may  be  characterized  as 
region-growing  based  on  extending  solution  paths  of  partial 
differential  equations,  Hannah's  method  consists  of  region 
growing  based  on  intensity  profile  correlations  between  two 
pictures  of  the  same  scene  taken  from  slightly  different  positions. 
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This  process  is  related  to  binocular  depth  perception.  In  the 
first  phase,  a  small  region  (window)  with  distinctive  statistical 
characteristics  (e.g.  large  gray-level  variance)  is  found  in  one 
picture.  Looking  only  at  parts  of  the  other  picture  with  the  same 
characteristics,  an  attempt  is  made  to  correlate  the  original 
window  with  a  window  in  the  other  picture.  When  this  succeeds, 
the  two  windows  are  considered  as  being  in  registration  with 
each  other,  i.e.  they  correspond  to  two  views  to  the  same  part 
of  the  scene.  A  region  is  grown  around  this  window  by  moving 
the  windows  in  both  pictures  the  same  distance  and  direction. 

When  an  abrupt  change  in  3-D  depth  occurs,  such  spreading  will 
fail  because  of  binocular  disparity,  a  shift  in  the  images 
relative  to  their  original  registration  positions.  Thus  each 
region  eventually  found  corresponds  to  a  surface  in  space  bounded 
by  abrupt  depth  changes.  Correlation  and  search  for  regions  in 
registration  with  each  other  is  extremely  expensive  computationally. 
Hannah  greatly  reduces  the  number  of  computation  steps  necessary 
by  exploiting  statistical  properties  of  the  picture,  using  fast 
algorithms  such  as  the  Past  Fourier  Transform,  and  restricting 
the  search  to  areas  deemed  likely  by  scene  and  image  geometry. 

The  latter  involves  employment  of  a  Camera  Model,  using  techniques 
developed  by  Sobel  (BIBTR  1970  and  BIBSA  1973,1974).  This  is 
a  mathematical  model  which  relates  camera  positions  to  image 
properties.  For  example,  an  image  point  in  one  picture  corres¬ 
ponds  to  a  point  in  three  space  which  could  be  located  anywhere 
on  a  line  of  sight  in  three  space.  The  image  of  this  line  of 
sight  in  a  second  picture  is  a  line  rather  than  a  point  if  the 
camera  is  not  in  the  same  position  in  both  cases.  In  searching 
the  second  picture  for  the  corresponding  image  of  a  point  in  the 
first  then,  one  need  only  look  along  this  line  even  if  the 
depth  of  the  point  in  the  scene  is  completely  unknown.  Guesses 
about  its  depth  correspond  to  restricting  the  search  to  segments 
of  this  line.  Thus,  the  fact  that  straight  lines  are  preserved 
as  a  class  under  projective  geometric  transformations  can  be 
exploited  to  great  advantage  even  when  there  are  no  straight 
lines  in  the  3-D  scene  or  its  images. 
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A  property  that  makes  Hannah's  approach  more  powerful  than 
any  discussed  so  far  is  that  her  region  analysis  is  not  impaired 
by  camouflage,  shading,  texture,  sharp  shadows  or  intensity  con¬ 
trast  edges  because  it  is  not  based  on  intensity  levels  and 
their  local  variations  but  on  correlations  between  arbitrarily 
structured  intensity  distributions  in  two  pictures.  She  has 
exploited  a  binocular  geometric  semantics  which  appears  to  be 
much  more  powerful  than  the  monocular  geometric  semantics  of 
other  methods.  We  will  now  examine  some  other  kinds  of  semantics 
used  in  scene  analysis. 


2.3.4  Incorporation  of  General  Semantics  and  Mechanical  Aids 

Feldman  and  Yakimovsky  (see  BIBTR  and  BIBSA,  both  authors, 
either  order  and  several  titles  relating  to  Semantic  Region  Grow¬ 
ing)  improved  the  results  of  region  growing  by  using  merging  rules 
based  on  semantic  or  (real  world)  phenomenon  in  the  scene.  The 
first  step  is  to  conservatively  apply  non- semantic  region  grow¬ 
ing  rules  with  merging  based  on  similarity  of  color  as  well  as 
light  intensity.  The  result  is  a  large  number  of  small  regions. 
Next,  semantic  meanings,  depending  on  the  problem  domain, 
are  attached  to  regions.  For  example,  in  a  typical  view 
through  the  windshield  of  a  car,  blue  regions  at  the  top  of  the 
picture  are  considered  sky;  they  are  merged  with  each  other  and 
while  (cloud)  regions  which  may  intervene.  At  the  bottom  of  a 
picture,  green,  yellow  and  brown  regions  are  interpreted  as 
grass  and  merged  together .  Regions  in  the  middle  of  the  road 
with  distinctive  shapes  are  identified  as  cars;  in  this  case 
merging  proceeds  as  long  as  the  geometric  shape  of  the  regions 
better  approximates  that  of  known  cars.  Similar  criteria  are 
evaluated  in  merging  regions  in  other  parts  of  the  picture.  The 
criteria  are  based  on  training  sessions  in  which  the  trainer 
evaluates  results  of  certain  merges.  In  the  actual  recognition 
process,  a  Bayesian  strategy  is  followed  to  maximize  the  proba¬ 
bility  of  making  the  best  choice.  The  end  result  is  a 
partitioning  of  the  picture  into  regions,  each  of  which  is 
labelled  according  to  its  meaning  in  the  real  world.  Regions 
are  identified  much  more  realistically  than  in  methods  where 
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semantics  are  not  employed  in  the  region  growing  rules. 

In  the  VISIONS  system,  Hanson  and  Riseman  (BIBTR  1974)  incor¬ 
porate  a  heterarchical  combination  of  high-level  (semantic) 
and  low-level  (geometric  feature  extraction)  methods  using 
parallel  computation.  At  the  semantic  level,  objects  which 
might  be  in  a  scene  are  modelled  by  abstract  relational  struc¬ 
tures  similar  to  those  of  Barrow  and  Popplestone  or  Yakimovsky. 

Low  and  high-level  processes  interact  in  the  following  manner. 
First,  low-level  processors  detect  features  which  suggest  a 
subset  of  objects  which  might  be  present  in  a  scene.  The  corres¬ 
ponding  semantic  models  are  selected;  they  direct  low-level 
processors  to  seek  features  which  would  confirm  or  deny  model 
validity.  In  the  case  of  denial,  new  features  found  could  be 
used  to  suggest  other  models,  and  so  forth.  The  low-level 
feature  detection  capabilities  are  very  flexible  and  include 
detection  of  local  color,  texture,  brightness,  and  contrast. 

The  selection  of  paticular  features  at  the  direction  of  high 
level  models  greatly  improves  efficiency  by  restricting  the 
search  to  relevant  information. 

Winston  (BIBTR  1370)  developed  a  system  which  learns  to 
recognize  complex  high-level  structures  constructed  of  blocks 
(polyhedra) .  Feature  extraction  and  line  detection  are  low-level 
processing  whereas  identifying  polyhedra,  as  in  the  programs  of 
Guzman  and  those  who  followed,  is  considered  high-level  proces¬ 
sing.  An  even  higher  level  description  of  a  scene  is  one  in 
which  the  blocks  form  structures  such  as  arches,  tables  and  towers. 
Such  structures  are  represented  in  Winston's  system  by  graph  data 
structures  whose  rnejor  nodes  correspond  to  substructures  and  links 
between  nodes  are  relations  such  as  is-supported-by  or 
is-a-part-of .  Additional  nodes  correspond  to  blocks  and  are 
linked  to  other  nodes  describing  their  attributes  (posture,  type 
of  polyhedron) .  Different  kinds  of  structures  have  different 
kinds  of  graphs,  but  many  times  one  type  of  structure  can  have 
variations  yielding  different  graphs.  The  goal  is  to  train  the 
system  to  distinguish  which  of  these  variations  are  crucial  in 
order  to  correctly  identify  structures  consisting  of  arrangements 
of  blocks.  It  is  accomplished  by  presenting  the  learning  program 
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with  "near  miss"  examples,  those  for  which  small  changes  transform 
one  kind  of  structure  into  another.  Thus  instead  of  using  a 
statistical  training  approach  which  would  blur  critical  distinc¬ 
tions,  the  latter  are  singled  out  for  special  attention. 

Winston's  approach  is  closedly  related  to  Minsky's  idea  of 
frames  (BIBTR  1974) ,  a  relational  data  structure  for  representing 
certain  global  situations  or  scenes.  The  frame  is  subject  to 
transformations  which  are  considered  to  be  minor  perturbations  of 
a  single  global  gestalt.  The  nature  of  the  transformations  is 
complex,  however,  involving  alterations  of  the  list  structure 
that  may  be  interdependent  and  difficult  to  anticipate.  The 
semantic  relations  transcend  those  of  the  real  world  in  AI.  This 
very  interesting  and  difficult  topic  lies  at  the  heart  of  much 
current  AI  research  but  is  beyond  the  intended  scope  of  this 
survey;  for  further  discussion  the  reader  is  referred  to  the 
general  AI  texts  in  BIBGEN,  to  Minsky,  Papert  and  Winston  (BIBTR, 
all  references  to  any  of  the  names)  and  in  particular  to  Winston 
(BIBSA  1975) . 

The  aim  of  scene  analysis  is  machine  comprehension  of  real 
world  scenes;  hopefully  under  natural  lighting  conditions.  Some¬ 
times,  however,  an  immediately  useful  application  is  desired,  for 
example  in  robot  assembly  of  parts  in  a  factory,  and  one  must 
fall  back  on  less  sophisticated  methods  for  determining  3-D 
configurations .  These  include  touch  sensors  attached  to  probes 
to  mechanically  measure  distance,  sonar,  and  artificial  lighting 
situations.  The  latter  exploit  highly  structured  lighting  which 
is  usually  used  to  infer  distance  by  triangulation  between  the 
known  position  and  direction  of  the  light  source,  position  and 
direction  of  the  camera,  and  the  image  of  the  light  on  an  object. 
Agin  and  Binford  (BIBSA  1973)  use  a  laser  in  such  a  system  at 
Stanford  University.  Shirai  and  Suva  (BIBSA  1971)  use  a  slit 
beam  for  range  finding  by  triangulation  in  scenes  with  polyhedra. 
Will  and  Pennington  (BIBSA  1971)  illuminate  a  scene  with  striped 
lighting  and  infer  planar  face  orientation  from  properties  of 
the  2-D  spatial  fourier  transform.  All  of  these  methods  represent 
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a  retreat  from  the  tradition  of  analyzing  natural  scenes  under 
natural  lighting.  In  part,  this  retreat  is  an  admission  of  the 
failure  of  attempts  to  correctly  analyze  any  but  the  most  arti¬ 
ficial  of  scenes.  Successful  scene  analysis,  as  modest  as  it  is, 
is  achieved  only  at  great  computational  expense.  Evidence  for 
the  latter  is  expressed  by  Smith  and  Coles  (BIBGEN  1973)  who  report 
that  analyzing  a  single  complete  scene  requires  ten  minutes  of 
central  processor  time  on  a  large,  high-powered  computer  at 
Stanford  University.  This  is  hopelessly  slow  for  a  practical 
robot  to  skillfully  navigate  a  fork-lift  loader,  l^t  alone  drive 
a  car.  In  the  next  chapter  possible  ways  of  overcoming  this 
frustrating  limitation  will  be  examined. 
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CHAPTER  3 .  OVERVIEW 


The  performance  of  artificial  visual  systems  is  far  below  even 
that  of  rather  primitive  natural  visual  systems.  Though  machine 
recognition  of  objects  could  be  improved  by  combining  several  of 
the  methods  discussed  previously,  the  result  would  be  a  cumber¬ 
some  system  requiring  large  amounts  of  computer  time  and  memory 
space.  The  expense  and  physical  size  of  such  a  system  could  cf 
course  be  significantly  reduced  by  the  currently  decreasing  cost 
and  size  of  computer  components.  However,  speed  cannot  be  signi¬ 
ficantly  improved  using  conventional  methods;  without  a  thousand¬ 
fold  speedup,  mobile  robots  which  interact  with  reasonable 
environments  are  impossible.  This  is  precisely  the  kind  of 
impasse  which  haunts  many  other  areas  ir  AI ;  methods  successful 
in  toy  worlds  do  not  generalize  effectively  to  real  ones  without 
drastic  reduction  in  performance.  The  combinatoric  explosion 
which  occurs  when  complexity  increases  in  generalizing  tc  real 
world  situations  is  often  combatted  by  embodying  semantics  of 
the  real  jrld  in  heuristics.  The  way  in  which  such  semantics 
are  represented  is  crucial  and  there  are  no  general  rules  icx 
choosing  either  the  semantics  or  their  best  representation.  This 
situation  applies  to  the  linguistic  approach  no  scene  analysis 
as  well.  There,  objects,  structures,  and  functional  aggregates 
of  objects  correspond  to  nodes  in  a  graph  whose  links  correspond 
to  relations  between  these  objects.  As  the  number  of  nodes 
increases  in  ge^'  alizing  to  more  complex  situations,  the  number 
of  po  ,-ui  1  ’  _  ..  increases  exponentially.  The  problem  cf 
deciding  which  links  are  relevant  becomes  increasingly  difficult 
and,  in  dynamic  situations,  relevance  may  change  in  noncbvious 
ways.  Minsky's  c-  nrept  of  frames  represents  an  attempt  to 
isolate  subparts  the  real  world  and  thereby  reduce  linkage 
complexity . 

The  problems  discussed  above  are  general  to  AI  and  it  appears, 
regrettably,  that  many  artificially  intelligent  systems  cannot 
be  significantly  generalized  from  their  toy  worlds.  Chandrasekaran 
and  Reeker  (BIBGEN  1972)  and  Dreyfus  (BIBGEN  1972)  discuss 


some  apparent  limitations  in  AI  research.  However,  in  scene 
analysis  it  appears  that  breakthroughs  can  be  realized  by  proper 
exploitation  of  geometric  semantics.  Since  the  phenomena  of 
projective  transformations  are  completely  expressible  mathemati¬ 
cally,  their  semantics  are  much  more  tractable  than  those  of 
other  Al  areas.  Any  mathematical  system,  however,  can  be 
expressed  in  a  large  number  of  different  ways  equivalent  formally 
but  enormously  different  in  terms  of  the  complexity  of  carrying 
out  certain  kinds  of  computations.  Roberts,  Mackworth,  Horn  and 
Hannah  explicitly  used  projective  semantics  but  in  completely 
different  representations,  and  for  different  purposes.  Is  there 
a  better,  mere  universal  representation?  Biological  examples 
may  help  answer  that  question.  Examination  of  biological 
processes  has  not  been  profitable  in  most  areas  of  AI  because 
very  little  is  known  about  neurophysiological  correlates  of 
conceptual  phenomena,  e.g.  words,  thoughts  or  logic.  In  vision 
however,  much  is  known  about  the  neurophysiology  and  its  low 
level  computational  capabilities.  Computational  geometry  is  a 
term  used  by  Minsky  and  Papert  (BIBSA  1969)  to  describe  the 
logical  mechanisms  which  are  used  to  deduce  geometric  properties 
in  quantized  spaces.  They  discussed  computations  for  topological 
geometry.  The  computational  geometry  of  some  affine  transforma¬ 
tions  is  discussed  in  Weiman  and  Rothstein  (BIBTR  1972) .  Lateral 
inhibition  is  a  good  example  of  biologically  observed  geometric 
computation.  This  is  the  inhibition  of  activity  of  neighboring 
nerve  cells  by  an  active  cell  in  a  network.  It  is  a  phenomenon 
common  to  many  sensory  nerve  networks;  in  vision  it  provides  a 
mechanism  for  smoothing  and  taking  spatial  derivatives  just  like 
the  finite  differencing  operators  mentioned  in  the  discussion  of 
contrast  features.  Marr  and  Pettigrew  (BIBTR  1973)  and  Marr 
(BIBTR  1974)  have  examined  more  subtle  geometric  computation 
capabilities  of  neurophysiological  structures  in  the  visual 
system. 

Some  of  the  problems  of  inferring  3-D  scenes  from  2-D  pictures 
involve  the  solution  of  partial  differential  equations  (PDE's) 
relating  geometric  quantities.  In  Horn's  approach  the  differential 
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quantities  were  components  of  tangents  to  surfaces  and  gradations 
in  image  intensity.  Though  Hannah's  method  does  not  use  PDE's 
explicitly,  the  binocular  disparities  used  as  a  basis  for  region 
analysis  are  angle  differentials  which  are  related  to  depth 
differentials.  In  fact,  the  relations  between  those  quantities 
have  been  expressed  in  terms  of  differential  geometry  by  Luneburg 
(BIBGEN  1947)  to  yield  some  rather  remarkable  and  counter-intuitive 
conclusions  about  our  binocular  perception  of  visual  space. 

A  closely  related  topic  that  is  virtually  unexplored  but  probably 
fruitful  is  projective  differential  geometry  (Wilczynski  BIBGEN 
1905) .  A  computational  advantage  of  representing  pictures  in 
discrete  space  (for  example  a  retina  or  a  digitized  picture)  is 
that  derivatives  of  various  orders  correspond  to  finite  differences 
and  the  operations  of  calculus  are  easily  approximated  by  simple 
arithmetic  operations  as  we  saw  in  the  discussion  of  contrast 
detectors.  This  fact  is  exploited  in  numerical  analysis  using 
grids  for  the  numerical  solution  of  PDE's  where  the  analytic 
solution  is  not  known  or  intractable.  These  numerical  solutions 
are  usually  solved  sequentially,  one  cell  at  a  time.  In  the 
biological  case,  many  nerve  cells  are  active  simultaneously. 

This  kind  of  parallel  processing  gives  biological  organisms  the 
capability  of  reacting  much  faster  than  artificial  visual 
systems  despite  the  fact  that  biological  components  are  much 
slower.  Parallel  computation  cannot  only  increase  speed  but 
often  completely  changes  the  expression  of  algorithms  (Traub, 

BIBGEN  1973) .  In  many  cases,  complex  sequential  algorithms 
can  be  expressed  in  terms  of  a  large  number  of  simple,  cooperat¬ 
ing,  simultaneously  active  (parallel)  algorithms.  This  is 
particularly  relevant  to  the  solution  of  PDE'3;  algebraically 
complicated  global  functions  are  often  the  solutions  of  simple 
differential  (local)  equations.  Recently  formal  models  of 
biological  developmental  processes  have  been  generalized  and 
studied  in  the  theory  of  formal  languages.  Called  L-systems 
after  the  biologist  Lindenmayer  who  originated  them,  these 
systems  are  important  to  the  theory  of  parallel  computation 


as  well  as  biological  models  (Herman  and  Rozenberg  BIBGEN  1975) . 
Horn  (BIBP1C  1974)  has  recently  incorporated  parallel,  discrete 
algorithms  in  a  system  for  inferring  illumination  of  surfaces 
from  image  intensities.  Since  image  intensity  is  a  function  of 
surface  orientation  and  illumination,  this  problem  is  closely 
related  to  his  Ph.D.  thesis  topic  discussed  earlier.  Remarkably, 
his  approach  explains  perceptual  phenomena  previously  poorly 
modelled  and  points  to  actual  neurophysiological  components  of 
the  mammalian  retina  which  could  be  carrying  out  the  parallel 
algorithms . 

The  parallel,  numerical,  PDE  approaches  just  discussed  in  a 
sense  deal  with  what  could  be  called  lower  level  data  in  vision. 
That  is,  they  show  how  3-D  depth  contours  can  be  inferred  but  do 
not  deal  directly  with  the  higher  concepts  of  shape  recognition, 
isolation  of  distinct  bodies,  and  determination  of  relations 
between  objects.  Global  abstractions  are  considered  in  gestalt 
psychology  as  being  the  result  of  associating  parts  on  the  basis 
of  something  they  have  in  common  (Haber  <and  Hershenson,  BIBGEN 
1973) .  This  includes  grouping  parts  that  are  geometrically  close 
to  each  other  and  grouping  together  parts  that  are  similar  to 
each  other.  Lester  (BIBTR  1974  and  1975)  has  quantified  the 
concepts  of  proximity  and  similarity  in  programs  which  group 
together  very  general  kinds  of  genetric  objects.  Though  he  only 
considered  2-d  pictures,  application  of  this  kind  of  idea  to 
inferred  3-D  structures  could  be  useful.  Hannah's  approach 
implicitly  joins  regions  on  the  basis  of  lateral  and  depth 
proximity  while  other  region  growing  methods  use  a  combination  of 
similarity  and  proximity.  If  one  generalizes  "common  fate"  gestalt 
grouping  concepts  to  include  "common  transformation"  the  role 
of  motion  in  vision  becomes  very  important.  This  is  an  area 
which  has  i^en  virtually  ignored  in  artificial  vision  which 
usually  concentrates  on  analyzing  form  in  static  pictures  and 
regards  motion  as  an  interpolation  between  two  static  pictures. 

This  ignores  completely  the  fact  that  motion  detectors  are 
present  in  large  numbers  in  all  mammallian  visual  nervous 
systems  and  that  when  images  are  stabilized  on  the  human  retina, 
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vision  ceases.  The  roles  usually  ascribed  to  motion  detectors 
are  providing  feedback  for  eye  movements  and  attracting  the  eye 
to  parts  of  a  scene  which  nay  move.  Though  there  is  certainly 
great  adaptive  value  in  these  roles,  motion  detectors  could  also 
play  a  fundamental  part  in  the  perception  of  geometric  form  and 
inferring  3-D  shape.  Platt  (BIBGEN  1960)  has  shown  how  null 
responses  from  motion  detectors  when  a  pattern  is  moved  on  a 
retina  can  be  used  to  recognize  straight  lines  and  circles  as  a 
result  of  self-congruence  properties  of  such  figures  under  certain 
transformations.  These  particular  figures  are  of  central  impor¬ 
tance  in  projective  geometry.  Generalizing  this  motion  to  three 
dimensions,  Johannsen  (BIBGEN  1975)  presents  strong  experimental 
evidence  that  human  assumption  of  3-D  self-congruence  of  objects 
in  a  scene  plays  a  powerful  role  in  interpreting  motion  visible 
only  on  a  2-D  display.  This  characteristic  is  exploited  in  the 
kinetic  depth  effect  in  computer  graphics.  When  a  2-D  image  is 
transformed  as  it  would  be  if  the  rigid  3-D  object  it  represents 
were  rotated  in  space,  the  image  immediately  "looks"  3-D. 

(Newman  and  Sproull  BIBGEN  1973) .  Human  perception  of  moving 
objects  does  not  appear  to  be  a  succession  of  static  frames;  in 
real  life  stagecoach  wheels  do  not  appear  to  rotate  backwards 
due  to  frame  strobing  as  in  motion  pictures. 

In  summary,  perceiving  motion  directly  could  provide  a  method 
for  separating  the  effects  of  projective  transformations  and  3-D 
motions.  Once  these  have  been  isolated,  scene  organization  can 
be  based  on  "common  3-D  motion"  as  a  gestalt  grouping  mechanism. 
For  example,  a  still  picture  of  a  flock  of  brown  birds  flying 
together  between  the  branches  of  a  large  brown  tree  would  be  very 
difficult  to  analyze  purely  by  the  methods  of  either  region  analy¬ 
sis,  edge  analysis  or  binocular  disparity  analysis.  There  are 
too  many  significant  small  regions  and  sharp  edges  to  analyze, 
and  many  weak  edges  separate  tree  branches  from  birds.  A  motion 
detecting  method,  however,  would  group  together  all  edges  sharing 
the  same  speed  and  direction.  This  idea  is  very  much  in  keeping 
with  Gibson's  (BIBGEN  1966)  who  proposes  that  sensory  systems 
measure  invariance  of  certain  stimulus  properties  under  transfor¬ 
mations  rather  than  properties  in  isolation.  Mechanisms  for 
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detecting  such  invariances  could  be  incorporated  in  integral 
geometry  schemes.  These  are  generalizations  of  the  Buff on  needle 
problem  from  statistics  and  probability  and  have  the  advantage 
that  geometric  properties  can  be  measured  to  any  desired  accuracy 
independent  of  coordinate  systems  using  statistical  inference 
(Novikoff  BIBGEN  1962) .  A  pool  of  edge  detectors  would  be  an 
ideal  candidate  for  embodying  such  mechanisms.  Combining 
the  ideas  of  this  paragraph  with  the  discussion  of  PDE's  in 
analyzing  static  scenes,  motion  is  the  derivative  of  position. 
Contrast  feature  detectors  linked  by  delays  can  be  used  as  motion 
detectors.  Relative  motion  of  eye  and  image  reduce  motion  detec¬ 
tion  and  edge  detection  to  the  same  kind  of  computation.  Now,  the 
systems  of  PDE's  for  static  picture  analysis  are  augmented  by 
equations  in  which  derivatives  with  respect  to  time  are  involved. 
This  added  constraint  and  the  integral  relation  between  motion 
and  distance  ought  to  be  explored  for  possible  embodiment  in 
artificial  visual  systems. 

The  fact  that  the  new  approaches  suggested  in  the  preceding 
paragraphs  do  not  strongly  resemble  traditional  approaches  is  not 
meant  to  discredit  the  latter.  All  of  them  have  elements  which 
are  important  stepping  stones  incorporated  into  the  new  sugges¬ 
tions.  Linguistic,  traditionally  AI  methods  have  not  been 
greatly  discussed  because  I  feel  there  are  many  geometric 
semantic  problems  which  can  be  solved  to  put  us  far  ahead  of 
where  we  are  now;  at  that  time  the  conceptual,  semantic  structures 
such  as  Minsky's  and  Winston's  will  be  even  more  important  than 
they  are  now.  Vision  and  intelligence  can  be  treated  separately, 
though  intelligent  vision  is  more  powerful  than  vision  alone. 

For  example,  the  lowly  housefly  has  excellent  vision  which  is 
used  to  navigate  at  high  speed,  avoiding  collision  in  environ¬ 
ments  containing  far  more  complex  objects  and  lighting  extremes 
than  those  current  robot  vision  systems  maneuver  through.  It  is 
doubtful  that  high  level  concepts  and  abstractions  are  involved. 

It  appears  quite  feasible  to  build  a  system  with  similar  capabili¬ 
ties.  If  in  addition  we  incorporate  knowledge  of  real  world  objects 
and  relations  into  such  a  system,  so  much  the  better,  but  the 
latter  is  not  necessary  for  excellent  machine  vision. 


IV. 1-48 

»  ' 


/ 


CHAPTER  4.  DESCRIPTION  OF  BIBLIOGRAPHY 


The  bibliography  is  divided  into  four  parts  whose  names  are 
also  the  names  of  indirect  access  permanent  files  on  the  CDC  6600 
at  the  Courant  Institute.  The  first  three  letters  of  the  names 
of  these  files  are  always  BIB;  remaining  letters  are  acronyms  for 
the  type  of  contents  of  each  file.  The  format  of  these  files  is 
such  that  copies  onto  Hollerith  cards  yield  symbols  compatible 
with  most  computer  systems,  permitting  portability.  Each  biblio¬ 
graphic  reference  is  formatted  for  easy  visual  perusal  and 
elementary  information  retrieval  or  updating  using  string 
processing  languages  or  standard  text  editors.  To  simplify  these 
tasks,  all  references  are  in  a  standard  form  that  encompasses  the 
wide  diversity  of  publication  types  which  range  from  disserta¬ 
tions  to  textbooks.  In  this  standard  form,  each  reference 
occupies  three  lines.  The  first  three  or  four  columns  of  each 
line  are  reserved  for  keys,  handy  links  for  infoimation  retrieval 
or  quick  visual  reference.  The  first  line  of  each  entry  contains 
the  author's  name(s),  last  name  first  followed  by  initials  followed 
by  the  date  of  publication  in  parentheses  (see  BIBGIDE  file  which 
follows  this  section  for  other  format  details) .  References  in  the 
text  are  given  by  author (s)  followed  by  the  file  name  and  date. 

The  division  of  the  bibliography  into  four  files  is  based  on 
document  accessibility  and  topic.  The  file  BIBSA  contains  refer¬ 
ences  closely  related  to  scene  analysis  (hence  SA  in  its  acronym) 
and  publically  available  in  journals  (key  JP)  books  and  conference 
proceedings.  The  key  JP  stands  for  journal  paper,  in  which  case 
the  third  line  (key  S  for  source)  gives  the  name  of  the  journal, 
volume  number,  and  pages  in  that  order.  The  journals  Artificial 
Intelligence  (American  Elsevier,  New  York  or  North-Holland, 
Amsterdam)  and  computer  Graphics  and  image  Processing  (Academic 
Press)  contain  the  bulk  of  scene  analysis  literature  in  this  form. 
Another  important  source  are  in  the  set  of  Proceedings  of  the 
International  Joint  Conferences  on  Artificial  Intelligence  refer¬ 
red  to  as  PIJCAI  in  the  bibliography  and  described  at  the  bottom 
of  the  file  BIBSA.  Also  important  is  the  Machine  Intelligence 
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series,  proceedings  of  international  workshops  in  AI  held  in 
Edinburgh,  Scotland.  Information  about  this  collection  is  also 
listed  at  the  bottom  of  the  file;  references  to  it  in  the  biblio¬ 
graphy  are  abbreviated  as  MI  in  the  third  or  source  line.  Often 
papers  in  both  these  collections  are  reworked  and  published  in 
the  journal  Artificial  Intelligence  as  can  be  deduced  by  looking 
at  authors  and  titles;  this  redundancy  is  useful  if  one  of  the 
sources  is  unavailable.  Finally,  these  collections  also  contain 
descriptions  of  robots  which  incorporate  scene  analysis  and 
other  methods  in  problem  solving  systems . 

Two  books  (key  BK  in  the  second  line  of  a  reference)  should  be 
singled  out  for  special  attention  in  BIBSA..  The  first,  a  textbook 
by  Duda  and  Hart  (BIBSA  1973)  gives  comprehensive  coverage  of 
the  areas  in  its  title,  excellent  overviews,  and  good  biblio¬ 
graphies.  The  second  is  Winston  (BIBSA  1975)  which  could  be 
subtitled  "Scene  Analysis  at  MIT."  The  coverage  is  not  compre¬ 
hensive  over  the  field  of  scene  analysis  but  contains  versions 
of  Minsky's  paper  on  the  concept  of  Frames,  and  Horn's  and  Waltz's 
dissertations;  previously  these  were  only  available  as  technical 
reports . 

The  file  BIBPIC  contains  a  small  selection  of  references  in 
picture  processing  methods  which  were  precursors  of  or  are 
connected  with  scene  analysis.  Most  important  are  the  surveys 
by  Rosenfeld  which  lead  to  further  references  in  picture  proces¬ 
sing  in  a  comprehensive  and  clearly  presented  way. 

BIBGEN  contains  general  references  in  mathematics,  biology, 
and  AI  related  to  the  discussion  in  Chapters  1  and  3  primarily. 

The  Handbook  of  Sensory  Physiology,  Volume  VII  (the  first  entry 
in  BIBGEN)  is  the  most  complete  source  on  the  neurophysiology 
of  vision  and  can  lead  the  reader  to  further  references  in  that 
area.  Most  of  the  titles  of  other  references  in  BIBGEN  are  self- 
explanatory. 

BIBTR  lists  technical  reports,  dissertations  and  memos  (keys 
TR,  DI  and  ME,  respectively,  on  the  title  line)  and  similar 
documents  related  to  scene  analysis  but  not  easily  accessible. 
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The  source  line  (key  S)  lists  the  institution,  department,  degree 
if  dissertation,  and  report  number  if  any,  in  that  order.  Most 
of  the  work  listed  in  BIBTR  was  carried  out  at  the  AI  labs  at  MIT 
or  Stanford  University,  California.  These  two  institutions  have 
strong  professional  interconnections  and  in  the  past  have 
received  considerable  federal  funding  for  AI  research.  Hence 
dissertations  usually  become  technical  reports,  the  most  accessi¬ 
ble  type  of  document  in  BIBTR.  Even  if  the  technical  reports 
are  unavailable,  the  dissertation  titles  give  the  reader  an  idea 
of  the  major  research  interests  of  their  authors;  seeing  the 
names  later  in  journal  articles  can  lead  to  a  good  guess  about 
the  topic  of  an  article.  In  addition,  revised  versions  of 
papers  listed  in  BIBTR  often  find  their  way  into  those  listed 
in  BIBSA;  thus,  to  find  a  more  publically  accessible  form  of 
a  paper  one  need  only  match  authors  and  seek  similar  titles  in 
BIBSA. 

The  least  accessible  documents  in  BIBTR  are  memos,  papers 
intended  only  for  internal  circulation  within  the  source  insti¬ 
tution.  Those  of  the  artificial  vision  research  group  at  MIT 
are  often  called  Vision  Flashes  (abbreviated  VF  in  the  bi jlio- 
graphy) ;  other  MIT  memos  in  AI  are  often  called  Working  Papers 
(abbreviated  WP  in  the  bibliography) . 

For  information  on  obtaining  the  bibliography  files  as  a  deck 
of  Hollerith  cards,  write  to  Malcolm  Harrison,  the  principal 
investigator  of  the  sponsoring  grant,  at  the  address  on  the  title 
page  of  this  report.  In  the  listing  of  the  deck  which  follows 
each  single  quotation  mark  (*)  comes  out  as  a  not-equal  sign  (/) ; 
the  former  was  removed  from  the  printer  chain  at  the  Courant 
Institute  to  make  room  for  special  symbols.  The  card  code  is  a 
4-8  punch  which  is  interpreted  as  a  single  quotation  mark  by 
most  systems. 
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SIBGIDEt  GUIDE  TO  FILES  FOR  SCENE  ANALYSIS  SURVEY  BIBLIOGRAPHY. 


BIBGEN  -  BIBLIOGRAPHY  OF  GENERAL  TOPICS  SUCH  AS  HATH  AND 
NEUROPHYSIOLOGY  RELATED  TO  THIS  SURVEY 

BIBPIC  -  BIBLIOGRAPHY  OF  PICTURE  PROCESSING  AND  EDGE  DETECTION 

BIBS A  -  BIBLIOGRAPHY  OF  EXPLICIT  SCENE  ANALYSIS  PAPERS  AND 

BOCKS*  PU6LICALLY  MARKETED  (JOURNALS  AND  PUBLISHERS) 

BI8TR  -  BIBLIOGRAPHY  OF  TECHNICAL  REPORTS  AND  SIMILARLY  HARD 
TO  GET  DOCUMENTS  CLOSELY  RELATED  TO  SCENE  AHALYSIS 

1)  IN  ANY  FILE  EACH  REFERENCE  CONSISTS  OF  THREE  LINES;  THE  FIRST 

ONE  OR  TWO  COLUMNS  OF  EACH  LINE  HOLD  A  KEY  WHICH  REFERS  TO  THE  KIND 

OF  INFORMATION  CONTAINED  ON  THAT  LINE.  THE  FIRST  LINE  ALWAYS  CONTAINS 
THE  AUTHORWS  NAME ( S ) *  LAST  NAME  FIRST  STARTING  IN  COLUMN  FOUR* 

FOLLOWED  BY  INITIALS  ANO  THEN  THE  YEAR  OF  PUBLICATION  IN  PARENTHESES; 
THE  KEY  TO  THE  FIRST  LINE  IS  WA*  (FOR  AUTHOR)  IN  COLUMN  1. 

2)  THE  SECOND  LINE  IS  ALWAYS  THE  TITLE  ENCLOSED  IN  SINGLE  QUOTES 

STARTING  IN  COLUMN  SIX,  BUT  THE  TwQ  LETTER  KEY  IN  COLUMNS  ONE  ANO 
TWO  GIVE  THE  TYPE  OF  PUBLICATION  AS  FOLLOWS: 


BK 

FOR 

BOOK, 

CF 

FOR  CONFERENCE  PROCEEDINGS  (UNLESS  PUBLISHED  AS  A  BOOK 
WHICH  IS  PUBLICALLY  MARKETED) 

01 

FOR 

A  DISSERTATION  OR  THESIS 

IN 

FDR 

A  PAPER  IN  SOME  OTHER  WORK 

JP 

FOR 

JOURNAL  PAPER 

ME 

FOR 

MEMO 

TR 

FOR 

TECHNICAL  REPORT 

3)  THE  THIRD  LINE  GIVES  THE  PUBLISHING  SOURCE  (KEY  S  IN  COLUMN 
ONE)  OF  A  WORK  STARTING  IN  COLUMN  SIX.  EXAMPLES  OF  THIS  ARE 
THE  PUBLISHER  FOR  A  BOOK,  THE  NAME  OF  A  JOURNAL*  THE  INSTITUTION 
FOR  A  TECHNICAL  REPORT  OR  THESIS,  THE  AUTHOR ( E D I  TOR )  AND  TITLE  FOR 
A  COLLECTION  A  PAPER  APPEARS  IN. 

WHERE  CONTINUATION  LINES  ARE  NECESSARY,  THEY  APPEAR  WITHOUT  KEYS. 
A  eLANK  LINE  FOLLOWS  EACH  REFERENCE.  SEE  MCOEIS  BELOW : 

A  LASTNAM1, I .  J .,  LASTNAM2,K.L „  AND  LASTNAM3, M.N.  (YEAR) 

JP  WTITLE  OF  PAPER  A 

S  JOURNAL  TITLE,  VOL  3,  PAGES  23A-3A5. 

A  LASTNAM, I . J .  (YEAR) 

TR  WTITLE  OF  TECHNICAL  REPORT** 

S  INSTITUTION,  DEPARTMENT,  LOCATION,  TECH  REPORT  NUMBER. 
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BIBfiENl  GENERAL  REFERENCES  IN  MATH*  BIOLOGY*  NEUROPHYSIOLOGY* 

ARTIFICIAL  INTELLIGENCE*  PSYCHOLOGY,  AND  PHILOSOPHY. 

******************* I************************************* *«*«**+ 

A  ASSORTED  EDITORS  (1972  AND  LATER  FOk  VARIOUS  PARTS) 

8K  *HANDBOOK  OF  SENSORY  PHYSIOLOGY**  VOLUME  VII*  VISION 
S  SPRINGER-VERLAG*  BERLIN  AND  NEW  YORK. 

A  BUSEMANN*  A.  AND  KELLY*  P.J.  (1953) 

BK  WPROJECTIVE  GEOMETRY  AND  PROJECTIVE  METRICS* 

S  ACADEMIC  PRESS*  NEW  YORK. 

A  CHANDRASEKARAN*  B.  AND  REEKER*  L.H.  (1972) 

TR  WARTIFIC IAL  INTELLIGENCE-  A  CASE  FOR  AGNOSTICISM* 

S  OHIO  STATE  UNIV*  COMP  AND  INFO  SCI  DEPT,  C I SRC-TR-72-9 . 

A  COXETER*  H.S.M.  (196A) 

BK  *PRO JECT I VE  GEOMETRY* 

S  BLAISDELL  PUB  CO*  NEW  YORK. 

A  OEUTSCH,  S.  (1966) 

JP  *CON JECTURES  ON  MAMMALIAN  NEURON  NETWORKS  FOR  VISUaL 
PATTERN  RECOGNITION* 

S  TRANS  IEEE,  SSC2,  DECEMBER,  PAGE  81. 

A  DODWELL*  P.C.  (1970) 

BK  *  VISUAL  PATTERN  RECOGNITION* 

S  HOLT*  RINEHART,  AND  WINSTON*  NEw  YORK. 

A  DREYFUS*  H.L.  (1972) 

BK  *WHAT  COMPUTERS  C AN*  T  DQ  -  A  CRITIQUE  OF  ARTIFICIAL  REASON* 

S  HARPER  AND  ROW,  NEW  YORK. 

A  FIRSCHEIN*  0.,  FI5CHLER*  M.A.,  COLES*  L.S.  AND  TENENBAUM,  J.M.  (1973) 
IN  WFQRECAS  T ING  ANO  ASSESSING  THE  IMPACT  OF  AI  ON  SUCIETY* 

S  PIJCAI  3,  STANFORD*  PAGES  105-120. 

A  GIBSON*  J.J.  (1966) 

BK  *  THE  SENSES  CONSIDERED  AS  PERCEPTUAL  SYSTEMS* 

S  HOUGHTQN-MI F  FL IN ,  BCSTON. 

A  HABER,  R.N.  AND  HERSHENSON*  M.  (1973) 

BK  *THE  PSYCHOLOGY  OF  VISUAL  PERCEPTION* 

S  HOLT*  RINEHART,  AND  WINSTON,  NEW  YORK. 

A  HERMAN*  G.T.  AND  ROZENBERG*  G.  (1975) 

BK  WDEVELOPMENTAL  SYSTEMS  AND  LANGUAGES* 

S  NORTH-HGLL AND,  AMSTERDAM. 
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A  HEWITT#  C.  (1971) 

IN  *PRGC6DUR AI  EMBEDDING  OF  KNOWLEDGE  IN  PLANNER* 

$  PIJCAI  2,  LONOGN#  PAGES  167-189 

A  JGHANNSON#  G.  (1975) 

JP  *VISUAL  MOTION  AND  PERCEPTION* 

S  SCIENTIFIC  AMERICAN#  VOL  232#  JUNE#  PAGES  76-68. 

A  JULESZ#  R.  (1971) 

BK  *FOUND ATIONS  OF  CYCLOPEAN  PERCEPTION* 

S  UNIVERSITY  OF  CHICAGO  PRESS#  CHICAGO#  ILL. 

A  L  LINE  BURG#  R.K.  (1997) 

BK  *MATHEMAT ICAL  ANALYSIS  OF  BINOCULAR  VISION* 

S  DARTMOUTH  EYE  INSTITUTE#  HANOVER#  NEW  HAMPSHIRE. 

A  MCCARTHY#  J.  ANO  HAYES#  P.J.  (1969) 

IN  *SQME  PHILOSOPHICAL  PROBLEMS  FROM  THE  STANDPOINT  OF  AI* 

S  MI  9#  PAGES  963-502. 

A  MCCULLOCH#  W.S.  (1995) 

JP  * A  HETERARCHY  OF  VALUES  DETERMINED  BY  THE  TOPOLOGY  QF  NERVOUS  NETS* 

S  BULLETIN  OF  MATHEMATICAL  BIOPHYSICS#  VOL  7#  PAGES  89-i3. 

A  MCCULLOCH,  W.S.  '1965) 

BK  WEM60DIMENT3  Of  MINO* 

S  MIT  PRESS#  CAMBRIDGE. MASS. 

A  MINSKY#  M.  AND  PAPERT,  S.  (1967) 

BK  *PROJECT  MAC  PROGRESS  REPORT  IV* 

S  MIT  PRES5#  CAMBRIDGE#  MASS. 

A  NEWMAN#  W.M.  AND  5PR0ULL#  R.F.  (1973) 

BK  *PRINC IPLES  OF  INTERACTIVE  COMPUTER  GRAPHICS* 

S  MCGRAW-HILL#  NEW  YORK. 

A  NILSSON#  N • J .  (1971) 

BK  *PROBLEM  SOLVING  METHQOS  IN  AI* 

S  MCGRAW-HILL#  NEW  YORK. 

A  NOVIKOFF#  A.  (1962) 

IN  *INTEGRAL  GEOMETRY  AS  A  TOOL  IN  PATTERN  PERCEPTION* 

S  VON  FOERSTER  AND  ZOPF  (EDS)#  *PRINCIPLES  OF  SELF-ORGANIZATION* 
PtRGAMQN  PRESS#  MACMILLAN#  NEW  YORK#  PAGES  397-368. 

A  PLATT#  J.R.  (1960) 

JP  *HGW  WE  SEE  STRAIGHT  LINES* 

S  SCIENTIFIC  AMERICAN,  VOL  202#  JUNE#  PAGES  121-129. 

A  RATTLIFF#  F.  (1965) 

BK  *MACH  BANOS*  QUANTITATIVE  STUOIES  ON  NEURAL  NETWORKS  IN  THE  RETINA* 
S  HOLDEN-DAY#  SAN  FRANCISCO. 


A  REISS*  R.F.  {EDITOR)  (1964) 

6K  #NEURAL  THEORY  AND  MODELLING# 

S  STANFORD  UNIV  PRESS*  STANFORD*  CALIF. 

A  ROTHSTEIN*  J.  (1956) 

8K  ^INFORMATION*  LOGIC  AND  PHYSICS# 

S  PHILOSOPHY  OF  SCIENCE*  VOL  23*  PAGES  31-35. 

A  SMITH*  M.H.  AND  COLES*  L.S.  (1973) 

IN  #DES IGN  OF  A  LOW  COST*  GENERAL  PURPOSE  ROBOT# 

S  PIJCAI  3*  STANFORD*  PAGES  324-335. 

A  SPRINGER*  C.E.  (1964) 

BK  #GEOMETRY  AND  ANALYSIS  OF  PROJECTIVE  SPACES# 

S  W.  H.  FREEMAN*  SAN  FRANCISCO. 

A  TRAU8*  J.F.  (1973) 

BK  #CQMPLEXITY  OF  SEQUENTIAL  AND  PARALLEL  NUMERICAL  ALGORITHMS# 

S  ACADEMIC  PRESS*  NEW  YORK. 

A  TURING*  A.M.  (1947) 

IN  #INTELL IGENT  MACHINERY# 

S  MI  5*  1970,  PAGES  3-23. 

A  VON  SENDEN*  M.  (1960) 

BK  #SPACE  AND  SIGHT# 

S  THE  FREE  PRESS*  GLENCOE*  ILLINOIS. 

A  WILCZYNSK1,  E.J.  (1905) 

BK  #PROJECTIVE  DIFFERENTIAL  GEOMETRY  OF  CURVES  AND  RULED  SURFACES# 

S  CHELSEA*  NEW  YORK,  (1961  REPRINT  OF  1905  TEUBNER  PUBLICATION). 

A  WINOGRAO*  T.  (1972) 

BK  #UNDERSTANDING  NATURAL  LANGUAGE# 

S  ACADEMIC  PRESS*  NEW  YORK. 

*********+♦♦♦***+#*****#*****##***♦♦***********+♦#*#****#*#♦**#*##♦♦♦ 
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BIBP IC i  BIBLIOGRAPHY  OF  PAPERS*  BOOKS*  AND  CONFERENCES  ON 

PICTURE  PROCESSING*  PATTERN  RECOGNITION  AND  EDGE  DETECTION. 

♦♦♦♦♦A******* * ******* **** ♦+ *** ###*#*#♦******* ft********************* 

A  DAVIS*  L.  S.  (1975) 

JP  *A  SURVEY  OF  EOGE  DETECTION  TECHNIQUES* 

S  COMPUTER  GRAPHICS  ANO  IMAGE  PROCESSING*  VOL  *,  PAGES  2*0-270. 

A  DUOA* R .0 •  ANO  HART* P . E .*( 1972 ) 

JP  MUSE  OF  THE  HOUGH  TRANSFORM  TO  DETECT  LINES  ANO  CURVES  IN 
PICTURES* 

S  CACM*  VOL  15*  PAGES  11-15. 

A  FREEMAN*  H.  (197*) 

JP  ^COMPUTER  PROCESSING  OF  LINE  DRAWING  IMAGES* 

S  ACM  COMPUTING  SURVEYS*  VOL  6*  PAGES  57-97. 

A  GERAROIN*  L.A.  AND  FLAMENT*  J.  (1969) 

IN  ^GEOMETRICAL  PATTERN  FEATURE  EXTRACTION  BY  PROJECTION  ON 
HAAR  ORTHONORMAL  BASIS* 

S  PIJCA1  1*  WASH  DC*  PAGES  65-78. 

A  GRASSELLI*  A.  (ED)  (1969) 

BK  *AUTOMATIC  INTERPRETATION  AND  CLASSIFICATION  OF  IMAGES* 

S  ACAOEMIC  PRESS*  NEW  YORK. 

A  GRIFFITH,  A.  K.  (1971) 

IN  *MATHEMATICAL  MODELS  FOR  AUTOMATIC  LINE  DETECTION* 

S  PIJCAI  2*  LONOON,  PAGES  17-26. 

A  GRIFFITH,  A.  K.  (1973) 

JP  *MATHEMATICAL  MODELS  FOR  AUTOMATIC  LINE  DETECTION* 

S  J ACM,  VOL  20,  PAGES  62-80. 

A  HORN,  B.K.P.  (197*) 

JP  *DETERMINING  LIGHTNESS  FROM  AN  IMAGE* 

S  COMPUTER  GRAPHICS  ANO  IMAGE  PROCESSING,  VOL  3,  PAGES  277-299. 

A  HUECKEL,  M.H.  (1971) 

JP  *AN  OPERATOR  WKICH  LOCATES  EDGES  IN  DIGITIZED  PICTURES* 

S  JACM,  VOL  18,  PAGES  113-125. 

A  HUECKEL,  M.H.  (1973) 

JP  * A  LOCAL  VISUAL  OPERATOR  WHICH  RECOGNIZES  EDGES  AND  LINES* 

S  JACM,  VOL  20,  PAGES  634-6*7, 

A  KANEFF,  S.  (ED)  (1970) 

BK  *P IC TUR E  LANGUAGE  MACHINES* 

S  ACADEMIC  PRESS,  NEW  YORK. 

A  LEVINE,  M.D.,  Q*HANDLEY,  D.A.  AND  YAGI,  G.M.  (1973) 

JP  *COMPUTER  DETERMINATION  OF  DEPTH  MAPS* 

S  COMPUTER  GRAPHICS  AND  IMAGE  PROCESSING,  VOL  2,  PAGES  131-150. 
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A  LIPKXN#  I.S«  ANO  ROSENFELD#  A.  (EOS)  (1970) 

BK  ^PICTURE  PROCESSING  ANO  PSYCHQPICTORICS# 

S  ACADEMIC  PRESS#  NEW  YORK. 

A  ORGORHAN  P.  AND  CLOWES#  M.B.  (1973) 

IN  RPINOING  EDGES  THROUGH  CQlllNEARITY  OF  FEATURE  POINTS#* 

S  PIJCAI  3#  STANFQRO#  PAGES  543-555. 

A  MILLER#  W.  ANO  SHAW#  A.  (1968) 

IN  ^LINGUISTIC  METHODS  IN  PICTURE  PROCESSING  -  A  SURVEY## 

S  AFIPS  PRQC  FALL  JOINT  COMP  CQNF#  VOL  33#  PAGE  279. 

A  PINGLE#  K.K.  ANO  TENENBAUM#  J.M.  (1971) 

IN  PAN  ACCOMODATING  EDGE  FOLLOWER# 

S  PIJCAI  2#  LONOON#  PAGES  1-7. 

A  GUAM#  L.H.  (1971) 

01  ^COMPUTER  COMPARISON  OF  PICTURES# 

S  STANFORO  UN IV#  COMPUTER  SCI  DEPT#  PH  D#  AIM-144. 

A  RAMER#  U.E.  (1975) 

JP  ^EXTRACTION  OF  LINE  STRUCTURES  FROM  PHOTOGRAPHS  OF  CURVED  OBJECTS# 
S  COMPUTER  GRAPHICS  AND  IMAGE  PROCESSING# VOL  4#  PAGES  81-103. 

A  REEO#  S.K.  (1973) 

BK  ^PSYCHOLOGICAL  PROCESSES  IN  PATTERN  RECOGNITION# 

S  ACAOEMIC  PRESS#  NEW  YORK. 

A  ROSENFELO#  A.  (1969) 

BK  PPICTURE  PROCESSING  BY  COMPUTER# 

S  ACADEMIC  PRESS#  NEW  YORK. 

A  ROSENFELO#  A.  (1969) 

JP  PPICTURE  PROCESSING  BY  COMPUTER# 

S  ACM  COMPUTING  SURVEYS.  VOL  1#  PAGES  147-176. 

A  ROSENFELO#  A.  (1972) 

JP  PPICTURE  PROCESSING*  1972# 

S  COMPUTER  GRAPHICS  ANO  IMAGE  PROCESSING#  VOL  1#  PAGES  394-41&. 

A  ROSENFELO#  A.  (1973) 

JP  PPRQGRESS  IN  PICTURE  PROCESSING#  1969-1971# 

S  ACM  COMPUTING  SURVEYS#  VOL  5#  PAGES  81-108. 

A  ROSENFELO#  A.  (1974) 
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APPENDIX  IV.2 

TWO  ALGORITHMS  FOR  CONSTRUCTING  A 
DELAUNAY  TRIANGULATION 


This  appendix  presents  a  comprehensive  description  of  methods  used  to  create  a  type 
of  Voronoi  tessellation  known  as  Delaunay  triangulation.  This  presentation  unifies 
material  from  a  variety  of  sources.  The  algorithms  which  are  presented  here  provide 
for  efficiency  in  representing,  storing,  and  displaying  ground  surface  coloration  and 
texture. 
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1.  Introduction 


In  this  paper  we  consider  the  problem  of  triangulating 
a  set  of  points  in  the  plane.  Let  V  be  a  set  of  N»3  distinct 
points  in  the  Euclidean  plane.  We  assume  that  these  points  are 
not  all  col  inear.  Let  E  be  the  set  of  straight  line  segments 
(edges)  between  vertices  in  V.  Two  edges  elte2eE,  e^ej, 
be  said  to  properly  intersect  if  they  intersect  at  a  point 
other  than  their  endpoints.  A  triangulation  of  V  is  a  planar 
straight-line  graph  G(V,E')  for  which  E'  is  a  maximal  subset 
of  E  such  that  no  two  edges  of  E'  properly  intersect  [16]. 

There  is  no  conceptual  difficulty  involved  in  constructing 
a  triangulation.  Any  set  of  points  can  be  triangulated  if 
edges  are  added  with  the  proviso  that  no  new  edge  intersects 
an  existing  edge.  We  will  investigate  a  particular  triangulation 
called  the  Delaunay  triangulation  [  3  ].  It  has  the  property  that 
the  circumcircle  of  any  triangle  in  the  triangulation  contains 
no  point  of  V  ir.  its  interior. 

This  paper  is  the  result  of  a  recent  study  whose  objective 
was  to  develop  an  efficient  algorithm  for  fitting  triangular 
faceted  surfaces  to  digital  terrain  data.  A  piecewise  planar 
surface  is  used  as  a  terrain  model  by  all  visual  flight  simulators. 
It  was  concluded  that  the  Delaunay  triangulation  is  an  excellant 
choice  for  this  application,  based  on  the  initial  objectives  of 
minimizing  computation  time  and  producing  a  good  visual  display. 
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In  the  next  section,  we  will  formally  define  the  Delaunay 
triangulation  and  review  its  properties.  Then  in  Section  3,  we 
will  provide  two  algorithms  for  its  construction.  Section  4 
will  cover  some  applications  of  the  triangulation. 

2.  Definition  and  Properties  of  the  Delaunay  Triangulation 

Suppose  that  we  are  given  a  set  V»{  ,. . .  ,vN>,  N&3, 

of  points  in  the  Euclidean  plane.  Assume  that  these  points  are 
not  all  colinear,  and  that  no  four  points  are  co-circular.  Let 
d(v.,v.)  denote  the  Euclidean  distance  between  points  v.  and  v.. 

I  J  *  J 

The  bisector  8(v.,v.)  of  vertices  v.  and  v.  is  the  locus  of 

I  J  '  \J 

points  equidistant  from  them;  i.e.  B(v. s v  .  )={ xeE2 1 d(x,v . )=d(x,v  . ) } . 

'  J  *  J 

The  half-plane  H(vj,Vj)  is  the  locus  of  points  closer  to  vi  than 

to  any  v.,  jj»1 ;  H(  v .  ,v  . )«{  xeE2  |d(x,v  .  )«d(x,v , ) ;  jj»1}.  The 
J  1  J  1  J 

polygonal  region  VD( vi  )=^,.H(vi  )  is  the  locus  of  points  closer 

to  vertex  v.  than  to  any  other  vertex.  Each  vertex  vi  therefore 
defines  a  region  VD(v-)  called  the  Voronoi  [37]  (Dirichlet, 
Wigner-Seitz  ,  Theissen  [36],  or  "S"  [24])  polygon  associated 
with  the  vertex.  The  collection  of  these  N  Voronoi  polygons  is 
referred  to  as  the  Voronoi  diagram  VD(V)  of  the  set  V  of  points 
[28,32], 

Voronoi  polygons  may  be  thought  of  as  the  cells  of  a  growth 
process.  Suppose  that  we  let  each  vertex  in  V  be  the  nucleus 
of  a  growing  cell.  Cells  will  propagate  outward  from  their  nuclei, 
simultaneously  and  at  a  uniform  rate.  The  border  of  a  growing 
cell  will  freeze  in  place  at  its  points  of  contact  with  the  border 
of  another  growing  cell. 
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Eventually,  only  the  cells  whose  nuclei  are  on  the  convex  hull 
of  V  are  still  expanding.  The  remaining  cells  have  completely 
partitioned  (or  tessellated)  a  region  of  the  plane  into  a  set 
of  non-overlapping  closed  convex  polygons,  one  polygon  about 
each  nucleus.  These  closed  polygons,  together  with  the  open  polygons 
on  the  convex  hull  define  a  Voronoi  tessellation  of  the  entire  plane. 

Let  us  take  a  closer  look  at  this  process.  Since  all  cells 
expand  at  the  same  rate,  the  first  point  of  contact  between  two  cells 
must  occur  at  the  midpoint  between  their  nuclei.  Likewise,  every 
point  of  continuing  contact  must  be  equidistant  from  the  two  nuclei. 
These  points  are  on  the  common  edge  (called  a  Voronoi  edge)  of  two 
developing  Voronoi  polygons.  This  edge  continues  elongating  until  it 
encounters  the  border  of  a  third  expanding  cell.  The  point  of  contact 
(called  a  Voronoi  point)  of  this  edge  and  the  border  of  the  tnird  cell 
must  be  equidistant  from  the  growth  centers  of  all  three  cells.  It  is 
therefore  the  ci rcumcen^er  of  the  triangle  defined  by  the  three  nuclei. 

Voronoi  cells  which  share  a  common  edge  are  called  Voronoi 
neighbors.  The  aggregate  of  triangles  formed  by  connecting  the 
growth  centers  of  all  Voronoi  neighbors  tessellates  the  area  within 
the  convex  hull  of  the  point  set.  This  tessellation  is  called  the 
Delaunay  tri angulation  DT(V)  of  V.  An  example  of  a  Voronoi  tessellation 
and  its  dual  is  shown  in  Figure  1. 

Each  Voronoi  point  corresponds  to  a  triangle  and  each  Voronoi 
edge  to  a  Delaunay  edge.  S' nee  the  number  of  Voronoi  points  and 
edges  are  both  0 ( N ) 1 ,  the  number  of  Delaunay  triangles  and  edges 
are  0(N).  To  be  more  precise  we  have  the  following. 

1.  Note:  We  say  that  g(n)=0(f(n))  if  !g(n)  |-;cf(n)  for  some  constant  c 
and  all  sufficiently  large  n.  We  say  that  g(n)=c(f(n))  if  (g(n);^cf(n) 
for  some  constant  c>0  arid  all  sufficiently  large  n.  We  say  that 
g(n)=9(f(n))  if  both  g(n)=0(f(n))  and  g(n)=fi(f (n) ) . 
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Lemma  1  Given  a  set  V  of  N  points,  any  triangulation  T(V) 
has  the  same  number  of  triangles,  Nt=2(N-l )-N^ ,  and  the  same 
number  of  edges,  Ng=3(N-l  )-N^,  where  is  the  number  of  points 
on  the  convex  hull  of  V. 

Proof  By  induction,  see  [11]  for  example. 

Now  we  will  state  without  proof  some  properties  of  the 
Delaunay  triangulation  DT(V). 

Lemma  2  Given  a  set  V={  v-j ,. . .  ,v^}  of  points,  any  edge  (v.,v. )  is  a 
Delaunay  edge  of  DT ( V )  if  and  only  if  there  exists  a  point  x  such 
that  the  circle  centered  at  x  and  passing  through  vi  and  v.  does 
not  cortain  in  its  interior  any  other  point  of  V. 

Corollary  Given  a  set  V={  v-j , . . .  ,vN }  of  points,  the  edge  (v^v^) 
on  the  boundary  of  the  convex  hull  of  V  is  a  Delaunay  edge. 

Lemma  3  Given  a  set  V*( v^ , . . . ,v^}  of  points,  av.v.v^  is  a  Delaunay 
triangle  of  DT(V)  if  and  only  if  its  circumcircle  does  not  contain 
any  other  point  of  V  in  its  interior. 

The  proofs  of  these  lemmas  can  be  found  in  [13,  32].  The  latter 
property  is  called  the  circle  criterion.  It  is  often  used  as  a 
rule  for  constructing  a  triangulation.  Triangul ation^  may  also  be 
constructed  according  to  the  MAX-MIN  angle  criterion,  i.e.  the  minimum 
r..aasure  of  angles  of  all  the  triangles  in  the  triangul ation  is 
maximized.  We  shall  investigate  the  relationship  between  these  two 
criteria. 
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The  following  analysis  follows  that  given  by  Lawson  in  [9]  and 
[10]  (see  also  Sibson  [33]).  Consider  a  very  simple  triangulation, 
that  over  the  vertices  of  a  strictly  convex  quadrilateral.  A 
quadrilateral  is  called  strictly  convex  if  its  four  interior  angles 
are  each  less  than  180°,  A  quadrilateral  can  be  partitioned  into 
two  triangles  in  two  possible  ways.  Each  of  the  criteria  described 
above  can  be  chought  of  as  a  rule  for  choosing  a  preferred  triangulation. 

By  examining  the  case  of  four  co-circular  points  (Fig.  2  ),  one 
can  show  that  the  two  criteria  are  equivalent.  Suppose  that  for  this 
example  line  segment  (vg.Vj)  is  shorter  than  (v^.v^),  (v^.v-j)  and 
(V1 *v2) *  Let  the  an9u1ar  measure  of  the  arc  v^.v^  be  29.  Then  the 
angles  <v.j,v.|,v.,  and  are  each  Ttius  the  two  P0S3lble 

triangulations  over  the  four  points  have  the  same  minimum  angle.  The 
choice  of  a  preferred  triangulation  is  then  arbitrary  according  to 
the  MAX -MIN  angle  criterion.  The  Voronoi  tessellation  of  the 
quadrilateral  also  exhibits  a  tie  case.  All  four  Voronoi  polygons 
meet  at  a  single  point. 

Further  analysis  shows  that  moving  one  DOint,  say  point  v^  in 
Fig.  2  ,  inside  the  circle  causes  cv^.v,.^  to  increase  and  points 
'! 2  and  v^  to  be  the  growth  centers  of  Voronoi  neighbors.  Consequently, 
the  two  criteria  and  the  Voronoi  tessellation  of  the  polygon  all  now 
prescribe  the  connection  of  vertices  v^  and  v^. 

A  fourth  criteria  has  been  studied,  that  of  choosing  the  minimum 
length  diagonal,  Shamos  and  Hoey  [32]  claim  that  a  Delaunay 
tr-;angulation  is  a  minimum  edge  length  triangulation.  Lawson  [9  ] 
and  Lloyd  [16]  prove  by  counterexample  that  this  is  not  the  case 
(Fig.  3). 


Lawson  [10]  gives  the  following  local  optimization  procedure  (LOP) 
for  constructing  a  triangulation.  Let  e  be  an  internal  edge  (in 
contrast  to  an  edge  on  the  convex  hull)  of  a  triangulation  and  Q  be 
the  quadrilateral  formed  by  the  two  triangles  having  e  as  their  common 
edge.  Consider  the  circumcircle  of  one  of  the  triangles.  This  circle 
passes  through  three  vertices  of  Q.  If  the  fourth  vertex  of  the 
quadrilateral  is  within  the  circle,  replace  e  by  the  other  diagonal 
of  Q,  otherwise  no  action  is  taken.  An  edge  of  the  triangulation  is 
said  to  be  locally  optimal  if  an  application  of  the  LOP  would  not 
swap  it.  Since  for  any  set  of  vertices,  the  number  of  triangles  in 
any  triangulation  is  a  constant,  a  linear  ordering  over  the  set  of  all 
triangles  can  be  defined  as  follows.  To  each  triangle  in  the 
triangulation  we  assign  a  value,  which  is  the  measure  of  its  minimum 
angle.  Let  denote  the  number  of  triangles.  For  each  triangulation 
we  have  an  indicator  vector  of  components,  each  component  corresponding 
to  the  minimum  angle  of  its  corresponding  triangle.  Triangles 
are  sorted  in  nondecreasing  order.  Given  any  two  triangulations 
T  and  T',  define  T<T ’  if  and  only  if  the  associated  indicator  vector  of 
T  is  lexicographically  less  than  that  of  T'. 

Theorem  1  [10,13]  Given  a  triangulation  T,  if  an  application  of  the  LOP 
to  an  edge  e  results  in  a  swapping  of  the  edge  with  any  other  edge  e', 
thus  producing  a  new  triangulation  T 1 ,  then  T < T ' ;  i.e.  T'  strictly 
follows  T  in  the  linear  ordering  defined  above. 

Proof  Let  I  be  an  indicator  vector  for  T.  The  measures  of  the  smallest 
angles  in  the  two  triangles  of  T  sharing  the  edge  e  occur  as  two  of  the 
components  of  I,  say  I ^  and  1^  with  j<k  and  thus  Since  a  swap 
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was  made  when  applying  the  LOP,  the  smallest  angle  in  both  of  the 
two  new  triangles  of  T'  sharing  the  edge  e'  must  be  strictly  greater 
than  I..  It  follows  that  the  indicator  vector  I  for  T  is 
lexicographically  less  than  the  indicator  I’  for  T1  and  thus  T<T'. 

Theorem  2  [10,13]A11  internal  edges  of  a  triangulation  T  of  a  finite 
set  V  are  locally  optimal  if  and  only  if  no  point  of  V  is  interior 
to  any  circumcircle  of  a  triangle  of  T. 

Proof  If  no  point  of  V  is  interior  to  the  circumcircle  of  any 
triangle  of  T,  then  the  application  of  LOP  to  any  edge  will  not 
swap  it.  Thus  all  edges  are  locally  optimal.  If  all  edges  are 
locally  optimal,  then  we  show  that  no  point  of  V  is  interior  to 
the  circumcircle  of  any  triangle.  Suppose  tnat  the  circumcircle 
K  of  a  triangle  aabc  contains  a  point  v  of  V.  Let  5  be  the  distance 
of  v  to  its  nearest  edge,  say  (a,c)  as  shown  in  Fig.  4.  Assume 
that  among  all  triangles  of  T  whose  circumcircles  contain  v  as  an 
interior  point,  none  has  an  edge  which  is  at  a  distance  less  than  5 
from  v.  Since  v  is  on  the  opposite  side  of  (a,c)  from  b,  the  edge 
(a,c)  is  not  a  boundary  edge  of  T.  Thus, there  is  another  triangle 
aacq  sharing  an  edge  with  triangle  aabc.  The  vertex  q  cannot  be 
interior  to  the  circle  K  as  this  would  contradict  the  hypothesis 
that  edge  (a,c)  is  locally  optimal.  The  vertex  q  cannot  be  in 
the  cross-hatched  region  of  the  diagram,  or  aacq  will  contain  v  in 
its  interior.  Suppose  that  edge  (c,q)  is  the  nearest  edge  of  aacq 
to  v.  Note  that  the  distance  from  (c,q)  to  v  is  less  than  5.  Since 
the  circumcircle  of  aacq  also  contains  v  in  its  interior,  we  have  a 
contradiction  that  aabc  is  the  triangle  with  an  edge  at  the  smallest 
distance  from  v. 
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By  the  above  theorem,  the  edges  of  the  Delaunay  tri angulation 


of  a  finite  set  V  of  points  are  locally  optimal.  If  we  assume 
that  no  more  than  three  points  are  co-circular,  the  uniqueness  of 
the  Delaunay  triangulation  [33]  suggests  the  following  theorem. 

Theorem  3  A  triangulation  T(V)  is  a  Delaunay  triangulation  if 
and  only  if  its  indicator  vector  is  lexicographically  maximum,  i.e. 
no  triangulation  follows  it  in  the  linear  ordering. 

Proof  If  the  triangulation  T  is  lexicographically  maximum,  then 

all  the  edges  of  T  must  be  locally  optimal,  which  implies  that  no 

circumcircle  of  any  triangle  will  contain  any  other  point  of  V  in 

its  interior  (Theorem  2).  Thus,  T  is  the  Delaunay  tri angulation 

DT(V).  To  prove  the  converse,  suppose  that  the  Delaunay  triangulaticn 

is  not  maximum  in  the  linear  ordering.  That  is,  there  exists  another 

triangulation  T(V),  such  that  DT ( V ) <T ( V ) .  Repeatedly  applying 

the  LOP  to  T(V)  will  produce  a  triangulation  T'(V)  in  which  all 

the  edges  are  locally  optimal.  Since  DT(V)<T(V)<T’ (V) ,  T'(V)  would 

also  be  a  Delaunay  triangulation  by  Theorem  2  and  Lemma  3.  However,  since  tne 

Delaunay  triangulation  is  unique,  T'(V)=DT(V),  a  contradiction. 

Corollary  1  The  Delaunay  triangulation  of  a  set  of  points  satisfies 
the  MAX -MI N  angle  criterion.  (Note  that  a  triangul ation  whicn 
satisfies  the  MAX-MIN  angle  criterion  is  not  necessarily  a  Delaunay 
triangulation. )  " 

We  have  shown  that  the  Delaunay  triangulation  of  a  set  of  points 
is  a  maximum  in  the  linear  ordering  over  the  set  of  possible  triangulations. 
Now  we  are  ready  to  describe  two  algorithms  for  its  construction. 


2.  Consider  a  triangulation  T  whose  indicator  vector  is  (i,,...,i^) 
where  i,  is  the  minimum  angle.  Suppose  that  aabc  is  the  triangle  1 
with  the  smallest  angle.  If  applying  LOP  to  the  edges  of  ab.bc.ca  will 
not  result  in  a  swap,  then  T  satisfies  the  MAX-MIN  angle  criterion. 
Since  we  have  only  checked  one  triangle  in  the  triangulation,  we  cannot 
say  that  T  is  a  Oelaunay  triangul ation .  However,  if  we  apply  the  LOP 
to  all  edges  of  T,  until  no  more  swapping  occurs,  the  resulting 
triangulation  will  be  Delaunay. 
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3.  Techniques  for  Constructing  the  Delaunay  Triangulation 


We  will  present  two  algorithms  for  constructing  a  Delaunay 
triangulation.  The  first  is  rather  involved,  but  its  running  time 
is  asymptotically  optimal.  The  second  algorithm  is  simpler  to 
understand,  simpler  to  program,  and  requires  less  overhead.  These 
features  make  it  especially  attractive  for  small  and  medium  size 
data  sets  (=500  points  or  less). 

The  first  algorithm  is  comparable  to  the  one  given  by  Lewis  and 
Robinson  [15]  in  that  it  divides  the  original  data  set  into  disjoint 
subsets.  After  obtaining  a  solution  for  each  of  these  subsets,  it 
combines  solutions  to  yield  the  final  result.  In  [15],  Lewis  and 
Robinson  claim,  wi thout  proof,  that  their  algorithm  runs  in  O(NlogN). 

But  in  fact3,  the  running  time  of  their  algorithm  is  O(N^).  The 
second  algorithm  that  we  will  present  is  iterative.  It  follows 
the  idea  proposed  by  Lawson  [9].  Nelson  [23]  gives  a  similar  method. 

3,1  Divide  -  and  -  Conquer 

Shamos  and  Hoey  [32]  present  an  e(NlogN)  algorithm  for  constructing 

the  Voronoi  diagram  for  a  set  of  N  points  in  the  plane.  Green  and 

Sibson  [7]  also  implement  an  algorithm  for  this  purpose,  but  the 
2 

running  time  is  0(N  )  in  the  worst  case.  Lee  [14]  modifies  the 
procedure  given  by  Shamos  and  Hoey  and  extends  the  method  to  any 
Lp  metric,  l<p$=-. 

Once  the  Voronoi  diagram  is  obtained,  its  dual  graph  (i.e.  the 
Delaunay  triangulation)  can  be  constructed  in  an  additional  C(N)  time. 

To  eliminate  the  need  for  a  two  step  procedure,  we  have  developed 


3.  There  are  at  least  four  triangulation  algorithms  in  the  computer 
literature  which  claim  to  be  O(NlogN),  but  which  are  in  fact  0(n2). 

A  counter-example  for  several  of  these  is  given  in  the  Append!/. 
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the  following  algorithm  which  constructs  a  triangulation  directly. 

The  running  time  of  the  algorithm  is  shown  to  be  O(NlogN).  Shamos 

and  Hoey  [32]  show  that  the  construction  of  any  triangulation  over  N 

points  requires  fl(NlogN)  time.  Thus,  our  algorithm  is  optimal. 

We  will  begin  by  describing  the  data  structures  and  notation 

to  be  used  in  the  sequel.  For  each  point  v^. ,  we  keep  an  ordered 

adjacency  list  of  points  where  (v^v^),  j»1,...k,  is 

a  Delaunay  edge.  The  list  is  doubly-linked  and  circular.  PRED(v.,v. .) 

denotes  the  point  v.p  which  appears  clockwise  (CW)  of  and  immediately 

after  the  point  v...  The  counter-clockwise  function  SUCC  operates 
^  J 

in  a  similar  manner.  For  example  in  Fig.  5  ,  v^=PRED( v-|  ,Vg )  and 

Vg=$UCC( Vi ,v^} . 

If  the  point  v.  is  on  the  convex  hull  CH(V),  then  the  first 

entry  on  its  adjacency  list  is  the  point  denoted  FIRST(v.),  which 

appears  after  v.  if  we  traverse  the  boundary  of  CH(V)  in  a  CCW 

direction.  Let  t(v^,v^)  denote  the  line  segment  directed  from  v.. 

to  v  . . 

0 


Now  we  are  ready  to  construct  the  Delaunay  triangulation. 

First,  we  sort  the  given  set  V  of  N  points  in  lexicographically 
ascending  order  and  rename  the  indices  so  that  v.j <v^<  . . . < v^ ,  such 
that  (x,,y.)=v.<v.  if  and  only  if  either  x.<x.  or  x.=x.  and  y.<y.. 

I  i  V  I  J  J  '  w 

Next  we  divide  V  into  two  subsets  and  VRJ  such  that  V ^ = { V -j  , . . .  »v^/2j ' 

and  VR»{ v  N/2j+1 . vN).  Now  we  recursively  construct  the  Delaunay 

triangulations  DT(V^)  and  DT(VR).  To  merge  DT(V^)  and  DT(VR)  to  form 


DT(V^UVR),  we  make  use  of  the  convex  hull  CH(V^UVR).  The  convex  hull 
can  be  obtained  in  0 ( N )  time  [  26  ]  from  the  union  of  the  convex  hulls 
CH(Vl)  and  CH(VR).  The  convex  hulls  can  also  be  computed  recursively. 
Forming  the  union  of  CH(V^)  and  CH(VR)  will  result  in 
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two  new  hull  edges  which  are  the  lower  and  upper  common  tangents  of 
CH(Vl)  and  CH(VR).  These  two  corrmon  tangents  are  known  to  be  in  the 
final  Delaunay  triangulation. 

The  following  subroutine  finds  the  lower  common  tangent  of  CH(VL) 
and  CH(Vr).  For  each  convex  hull  CH(S),  we  maintain  two  points 
IM($)  and  RM(S),  which  are  the  leftmost  and  rightmost  points  of  S, 
respectively. 

Subroutine  HULL 

( Comment :  HULL  is  input  two  convex  hulls.  It  finds  their  lower 
common  tangent  (Fig.  6).  The  upper  common  tangent  can 
be  found  in  an  analagous  manner.) 

(Comment:  i(X,Y)  denotes  the  line  directed  from  X  to  Y.) 

X-RM(Vl)  ;  Y-*-LM(Vr) 

Z-*-FIRST(Y) ;  Z * -F I RST ( X ) ;  Z"-PRED(X,Z'  ) 

A:  IF  (Z  is-right-cf  t(X, Y)) 

Z-*-SUCC(Z,Y) 

Y-Z 

ELSE 

IF  (Z"  is-right-of  z(X,Y) 

Z"-PRED(Z",X) 

X^Z" 

ELSE 

RETURN  (X,Y) 

ENDIF 

ENDIF 
GO  TO  A 
END  HULL 
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The  lower  common  tangent  will  be  used  as  an  input  to  the  following 
subroutine  which  merges  the  two  triangulations  DT(V^)  and  DT(VR). 

Subroutine  MERGE 

(Comment:  MERGE  is  input  two  tri angulations  and  the  upper  and  lower 
common  tangents  of  their  convex  hulls.  It  merges  the  two  triangulations, 
starting  with  the  lower  common  tangent,  zig-zagging  upward  until  the 
upper  common  tangent  is  reached.  ) 

(Comment:  Initially,  the  points  adjacent  to  the  endpoints  of  the  lower 
common  tangent  are  examined.  Using  the  circle  criterion,  we  either 
connect: 

(i)  the  left  endpoint  (in  V^)  of  the  lower  common  tangent  to  a 
point  adjacent  to  the  right  endpoint  (in  VR)  of  the  lower 
common  tangent,  or 

(ii)  the  right  endpoint  (in  VR)  of  the  lower  common  tangent  to  a 
point  adjacent  to  the  left  endpoint  (in  V^)  of  the  lower 
common  tangent. 

The  above  process  is  repeated  with  the  newly  found  edge  taking  the  place 
of  the  lower  coiranon  tangent,  and  each  succeeding  new  edge  taking  the  place 
of  that.  The  subroutine  continues  in  this  manner  until  the  upper  common 
tangent  is  reached.  ) 

(Comment:  The  adjacency  list  of  the  right  (left)  endpoint  in  VR(VjJ  of 
the  current  edge  being  considered  is  examined  in  a  CW  (CCW)  direction 
starting  with  the  left  (right)  endpoint  of  the  edge.  ) 

(Comment:  INSERT ( A , B )  inserts'  point  A  into  the  adjacency  list  of  B  and 
point  B  into  the  adjacency  list  of  A  at  proper  positions.  OELETE(A.B) 
deletes  A  from  the  adjacency  list  of  B  and  B  from  the  adjacency  list 
of  A.  ) 


(Cormjent:  QTEST(H,I,J,K)  tests  the  quadrilateral  having  CCW  ordered 
vertices  It  returns  TRUE  if  the  circumc  rcle  of  iHIJ  does 
not  contain  K  in  its  interior,  and  returnsFALSE  otherwise.  ) 


step  1 


step  2 
step  3 
step  4 
step  5 
step  6 
step  7 
step  3 


step  9: 


INITIALIZATION 
8T-lower  common  tangent 
UT-upper  common  tangent 
L-left  endpoint  of  BT 
R-right  endpoint  of  BT 

00  UNTIL  (BT  equals  UT) 

A-8-FALSE 

INSERT ( L , R) 

R-j*-PRED(  R  ,L ) 

IF  (R]  is-left-of  i(L,R) ) 

r2-pred(r,r1) 

00  UNTIL  (QTEST ( R-,  ,L,R,R?)) 
DELETE  (R,R.)  1  4 

RrR2  1 

rUpreoir.r,) 

EN0  DO  UNTIL 

ELSE 
A  -TRUE 
ENDIF 


step 

10: 

Lj-SUCCU.R) 

step 

11: 

IF  ( L -j  is-right-of  i(R,L) 

step 

12: 

L2-SUCC(L,L1 ) 

step 

13: 

DO  UNTIL  (QTEST(L  , R , L ,L-) 
DELETE  (L.LJ  1  d 

L  -L 

lI-s§cc(l,l1) 

END  DO  UNTIL 

step 

14; 

ELSE 

B-TRUE 

ENDIF 

step 

15: 

IF  (A) 

L^1 

step 

16: 

ELSE 

step 

17: 

IF  (B) 

R-R1 

step 

18: 

ELSE 

step 

19: 

IF  iqTEST(L,R,R,,L,) 
R-R1  1  1 

step 

20: 

ELSE 
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ENDIF 


step  21 : 
step  22: 
step  23: 

step  24: 

To  show  that  the  above  algorithm  merges  two  triangulations 
correctly,  it  is  sufficient  to  show  that  each  edge  we  insert  into 
the  triangulation  is  a  Delaunay  edge  (step  4).  Initially,  the 
first  edge  (L,R)  is  a  lower  common  tangent  and  is  known  to  be  a 
Delaunay  edge.  Steps  5-8  delete  the  edges  in  DT(VR)  which  are 
not  Delaunay  edges  in  DT ( V )  by  determining  if  L  is  within  the 
circumcircle  of  aR,^,^.  If  so,  the  edge  (R,R^)  is  not  a  Delaunay 
edge  and  must  be  deleted.  Steps  10-13  perform  the  same  operation 
on  the  edges  in  DT(VL).  An  example  of  this  process  is  shown  in 
Fig.  7a.  Now  the  circumcircle  KR  of  al,R,Ri  does  not  contain 
any  point  of  VR  in  its  interior  and  the  circumcircle  of  aR,L,L^ 
does  not  contain  any  point  of  in  its  interior.  Now  as  shown 
in  Fig.  7b,  we  have  a  choice  of  either  connecting  to  R  or  R, 
to  L.  Step  19  chooses  the  correct  edge  by  applying  the  circle 
criterion.  In  Fig.  7  ,  contains  R^  in  its  interior,  so  we 
choose  the  edge  (L,R^).  All  that  we  have  left  to  do  is  to  show 
that  the  edge  (L,R-j)  is  a  Delaunay  edge,  or  equivalently  that  the 
circle  KR  does  not  contain  any  point  of  in  its  interior. 

Since  the  edge  (l,R)  is  known  to  be  a  Delaunay  edge,  by  Lemma  2  there 
exists  a  circle  K  passing  through  L  and  R  which  contains  no  point 
of  V  in  its  interior.  It  is  also  known  that  the  circle  contains 


ENDIF 

ENDIF 

B>i(L,R) 

END  DO  UNTIL 
END  MERGE 
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no  point  of  in  its  interior.  Since  the  circle  KR  lies  inside 
the  union  of  K  and  K^,  it  follows  that  KR  contains  no  point  of 
in  its  interior.  We  combine  this  result  with  the  fact  that 
KR  does  not  contain  any  point  of  VR  in  its  interior  to  conclude 
that  Kr  contains  no  point  of  V  in  its  interior.  Thus,  the  edge 
(L,R.|)  is  a  Delaunay  edge.  The  edge  ( L ,R-j )  can  now  be  inserted 
to  replace  the  edge  (L,R).  In  showing  that  the  next  edge  to  be  added  after 
(L.R-j)  is  a  Delaunay  edge,  the  circle  KR  plays  the  same  role  as 
the  circle  K  just  did. 

Not  let  us  analyze  the  algorithm  MERGE.  We  first  note  that 
during  the  merge  process,  once  an  edge  is  deleted,  it  will  never 
be  re-examined.  Since  the  total  number  of  edges  deleted  is  0(N) 
and  the  total  number  of  edges  added  is  also  0(N),  the  time  needed 
for  the  merge  is  Q(N),  Let  t(N)  denote  the  time  required  to 
construct  the  Delaunay  triangulation  for  a  set  of  N  points.  We 
have  the  following  recurrence  relation 

t(N)  =  2t(N/2)  +  M(N/2,N/2) 
t(l)  =  0, 

where  M  represents  the  time  required  for  the  merge  process.  Since 
M(N/2,N/2)*0(N) ,  t(N)=0(NlogN) . 
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3,2  Iteration 

The  algorithm  presented  in  this  section  iteratively  triangulates 
a  set  of  points  within  a  rectangular  region.  If  the  point  set  does 
not  include  all  four  vertices  of  the  rectangle,  the  missing  vertices 
are  implicitly  added.  The  algorithm  uses  the  swapping  approach 
developed  by  Lawson  in  [9]  and  [10].  Since  the  convex  hull  of  our 
point  set  is  a  rectangle  and  is  known  in  advance,  we  need  not 
compute  it,  as  is  done  in  Lawson's  algorithm.  The  initial  ordering 
of  the  point  set  also  differs  from  that  used  by  Lawson. 

This  algorithm  was  developed  with  the  terrain  fitting  problem  in 
mind.  Terrain  regions  are  processed  in  rectangular  blocks.  Adjacent 
terrain  regions  must  fit  together  without  any  gaps.  Once  a  triangular 
faceted  surface  is  fit  to  a  terrain  region,  the  accuracy  of  the  fit  is 
usually  computed.  If  the  approximation  surface  does  not  meet  the  given 
accuracy  constraints,  additional  vertices  are  added  and  the  triangulation 
is  updated.  An  iterative  triangulation  algorithm  is  ideal  for  updating. 

Algorithm  SWAP 

INITIALIZATION 

step  1:  Given  a  set  V  of  N  points  within  a  rectangle,  remove  any  points 
which  fall  on  the  vertices  of  the  rectangle. 

1/2 

step  2:  Partition  the  rectangle  into  approximately  N  '  bins  (Smaller 
rectangular  regions). 

step  3:  Re-order  the  points  by  bins,  starting  at  some  bin  and  proceeding 
to  neighboring  bins  (see  Fig.  8  ). 

step  4:  Place  the  first  point  into  the  rectangle.  Connect  the  point  to 
the  four  corners  of  the  rectangle  to  produce  an  initial  triangulation. 
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ITERATION 


(Comment :  The  remaining  points  in  V  will  be  iteratively  added  to  the 
triangulation.  After  each  point  is  added,  it  will  be  connected  to 
the  vertices  of  its  enclosing  triangle.  The  triangulation  will  tnen 
be  re-structured  so  that  the  MAX-MIN  angle  criterion  is  globally 
satisfied.  [See  Fig.  9.]  ) 

step  1:  Input  the  next  point  to  the  existing  triangulation.  Connect 
this  point  to  the  vertices  of  its  enclosing  triangle, 

step  2:  Step  1  will  produce  up  to  four  strictly  convex  quadrilaterals. 
(Four  quadrilaterals  only  occur  when  a  newly  introduced  point  falls 
on  the  edge  of  the  triangulation.)  Each  of  these  quadrilaterals  has 
an  alternate  diagonal.  Swap  a  diagonal  with  its  alternate,  if  doing 
so  is  required  to  satisfy  the  MAX-MIN  angle  criterion  within  the 
quadrilateral  (i.e.  use  the  LOP  within  the  quadrilateral). 

step  3:  Each  swap  performed  in  step  2  may  result  in  two  new  quadrilaterals 
that  need  to  be  tested.  If  one  of  these  quadrilaterals  doesn't  satisfy 
the  MAX-MIN  angle  criterion,  swap  its  diagonal  with  its  alternate. 

step  4:  This  swapping  procedure  may  propagate  further  outward.  Lawson 
[10]  has  shown  that  this  process  will  always  terminate. 

step  5:  If  all  points  in  V  have  been  used  then  stop,  otherwise  go  to 
step  1 . 

END  SWAP 

Step  1  requires  the  identification  of  the  enclosing  triangle  of 
a  point.  This  can  be  accomplished  by  tt®  following  very  simple 
subroutine  [  10].  The  subroutine  assumes  that  the  triangulation  is 
stored  using  a  variation  of  Lawson's  data  structure  (given  in  the 
Appendix) . 
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Subroutine  TRIFIND 

(Comment:  The  tri angul ati on  is  stored  in  the  form  given  in 
Appendix  2. This  subroutine  locates  the  triangle  r  which  encloses  the 
point  (x0,yQ).  ) 

(Comment:  X(t,i)  denotes  the  x  value  of  the  i-th  vertex  of  triangle  t.  ) 

T+'last  triangle  c.eated 
LOOP:  DO  FOR  I~1  to  3 
I  PLUS  W  (mod  3)+l 

IF  C(y0-Y(T,I))*(X(TtIPLUSl)-x0)  .ST.  (xq-X(t ,1 ) )*( Y(r , IPLUS1 )-yQ)] 

Comment:  If  ( xQ ,yQ )  is  not  in  t,  jump  to  the  neighbor  of  t  which 
is  in  the  direction  of  the  point. 

t^N(t.IPLUSI) 

GO  TO  LOOP 

ENDIF 

END  DO  FOR 

RETURN  (t) 

END  TRIFIND 


Each  iteration  of  algorithm  SWAP  requires  an  0(N)  searcn  performed 
by  TRIFIND,  followed  by  an  0(N)  swapping  procedure.  Thus,  the  algorithm 
is  0(N2). 

An  empirical  examination  of  algorithm  SWAP  yields  somewhat  better 
results.  If  the  initial  point  set  is  uniformly  distributed  within  its 
enclosing  rectangle,  then  the  number  of  data  points  in  each  bin  will  be 
approximately  CKN1*).  Thus  the  search  procedure  will  be  0(Nls).  The 
swapping  procedure  which  updates  the  triangulation  can  be  propagated 
many  times.  We  have  found  that  two  levels  of  swaos  are  nearly  always 

sufficient  to  globally  satisfy  the  MAX-MIN  angle  criterion.  Thus  the 

21? 

algorithm  is  roughly  0(N  '),  empirically. 

IV. 2-25 


4,  Examples  and  Applications 


(1)  Random  Delaunay  Tri angulation 

A  two-dimensional  Poisson  process  of  intensity  A  can  be  used 
to  describe  a  random  distribution  of  points  in  the  plane.  This  process 
is  characterized  by  the  property  that  the  expected  number  of  points 
within  a  region  of  area  A  is  aA,  irrespective  of  the  shape  or 
orientation  of  the  region. 

Suppose  we  let  the  points  chosen  by  a  Poisson  process  seed  a 
Delaunay  triangulation.  The  resulting  network  of  triangles  inherits 
the  properties  of  homogeneity  and  isotropy  from  the  driving  point 
process. 

A  random  Delaunay  triangulation  is  probably  the  only  non-regular 
triangulation  which  is  mathematically  tractable.  Many  of  its 
statistical  properties  have  been  derived  by  Miles  [20].  its  principal 
first  order  statistics  are  given  below,  the  paper  by  Mil^s  also  provides 
the  associated  second  order  statistics. 

E[cell  area]  *  n/\ 

E[cell  circumference]  *  32/[3ir(A/2»)is] 

E[call  in-radius]  *  (8a/tt ) 

Miles  has  also  derived  the  probability  density  function  f(o)  for  an 
arbitrary  angle  a  in  the  triangulation. 

f(a)  *  4(  (ff-a)cosa+sina}'S---° 

a 

For  certain  applications,  we  would  like  a  triangulation  with  as 
few  small  angles  as  possible.  The  distribution  f(a)  provides,  a 
characterization  of  the  "goodness"  of  a  triangulation. 
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(2)  Interpolating  Functions  of  Two  Variables 


A  major  application  of  triangulations  is  the  interpolation  of 
functions  of  two  variables,  where  the  function  is  initially  defined 
only  at  an  irregular  set  of  locations.  These  locations  are  used 
as  the  vertices  of  a  triangulation.  The  value  of  the  function  at  a 
point,  other  than  a  vertex,  is  computed  by  performing  an  interpolation 
within  the  triangle  containing  the  point.  A  triangulation  composed 
of  nearly  equiangular  triangles  is  considered  good  for  this  purpose. 
McLain  [19]  and  Lawson  [10]  have  used  the  Delaunay  triangilation 
for  this  purpose.  Also  see  Powell  and  Sabin  [25]. 

(3)  Decomposition  of  Polygons  into  Convex  Sets 

in  algorithm  for  decomposing  polygons  into  triangles  may  be 
based  upon  the  concept  of  the  Delaunay  triangulation.  There  are 
applications  in  pattern  recognition  and  computer  graphics  for  which 
one  wants  to  combine  adjacent  triangles  into  larger  convex  sets. 

An  0(mN)  algorithm  for  decomposing  a  non-convex  polygon  of  N  sides 
and  m  reflex  angles  into  convex  sets  is  given  by  Schachter  [29]. 

(An  O(NlogN)  polygon  triangulation  algorithm  not  based  upon  the 
Delaunay  triangulation  is  given  by  Garey  et  al.  [5].) 

(4)  Terrain  Fitting 

A  rectangular  terrain  region  may  be  represented  by  an  array  of 
elevation  values.  A  two-dimensional  digital  filter  (e.g,  a  Wiener 
filter)  can  be  applied  to  this  data  to  extract  local  extrema  (i.e. 
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peaks  of  mountains  anci  "pits"  of  valleys)  and  ridge  line  segments. 

He  would  like  the  local  extrema  to  be  the  vertices  of  a  triangulation 
and  the  ridge  line  segments  to  fall  on  the  edges  of  the  tri angulation. 
This  structure  can  be  obtained  as  follows.  Let  L  denote  the  set  of 
local  extrema  and  E  the  set  of  endpoints  of  the  ridge  segments. 

For  each  element  of  LUE  falling  within  the  smallest  circumscribing 
circle  of  a  ridge  segment,  construct  a  normal  projection  onto  the 
segment.  Let  P  denote  the  projection  point  set.  Now,  let  the  points  in 
LUEUP  seed  a  Delaunay  triangulation.  Each  element  of  LUEUP  has  an 
associated  elevation.  The  triangulation  tnerefore  defines  a  piece-wise 
planar  approximation  to  the  terrain  surface.  For  certain  applications, 
an  approximated  surface  must  fit  the  original  data  to  a  given  error 
tolerance.  If  this  error  bound  is  not  satisfied,  additional  vertices 
are  added,  and  the  triangulation  is  updated.  An  iterative  algorithm 
is  well  suited  for  updating. 

The  above  technique  assumes  that  the  line  segment  set  is  sparse  in 
the  plane.  A  good  solution  for  the  more  general  problem  of  triangulating 
any  planar  set  of  points  and  line  segments  is  giv.n  by  Lee  [13], 

A  Delaunay  triangulation  is  an  excellent  choice  for  the  terrain 
fitting  and  display  problem.  Triangles  with  very  small  angles  produce 
a  poor  computer  graphics  display.  Maximizing  the  minimum  angle  within 
the  Cri angulation  Insures  the  best  possible  visualization  of  the  data. 
Further  requirements  are  mat  concerning  speed  of  construction. 

(5)  Spatial  Pattern  Models 

The  Voronol  and  Delaunay  tessellations  have  been  extensively  used 
to  model  spatial  patterns  in  a  wide  range  of  fields  inc’uding  astronomy 
[3],  bio-mathematics  [1,18,24,35],  computer  science  [4,11,12,30-32], 
geography  [2,17,27],  meteorology  [36],  metalurgy  [6],  numerical 
analysis  [10,24,25],  and  packing  and  covering  [22,28].  We  will 
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briefly  consider  two  somewhat  representative  examples. 

Suppose  that  we  are  given  a  collection  of  sites,  where  each  site 
has  a  random  variable  associated  with  it.  Let  these  sites  seed  a 
Voronoi  tessellation.  The  statistical  dependence  between  (Voronol) 
neighboring  sites  may  be  specified  in  terms  of  the  Delaunay  edge 
length  between  sites  and  the  Voronoi  border  length  between  their  cells. 
See  Besag  [1]  for  details  and  references. 

Stearns  [34]  poses  the  following  problem; 

"A  domain  wall  in  ferromagnetic  materials  can  be  considered  as  a 
two-dimensional  membrane  which,  when  subject  to  an  r.f.  field,  will 
oscillate  in  a  manner  determined  by  the  boundary  conditions.  One 
possible  set  of  boundary  conditions  would  correspond  to  pinning  the 
wall  at  impurities  whose  positions  are  random  in  the  wall.  In 
describing  wall  motion,  we  must  know  the  area  distributions  of 
triangles  formed  from  three  impurity  sites.  These  triangles  will 
contain  no  other  impurity  pinning  points  in  their  Interior  and  will 
be  called  'good'  triangles.  What  is  the  probability  distribution 
of  the  areas  of  the  resulting  network  of  'good'  triangles  formed 
by  choosing  N  points  distributed  uniformly  in  a  given  area?" 

Miles  [21]  interprets  'good'  to  mean  Delaunay,  and  proposes  a  solution. 


5,  Discussion 

We  have  presented  two  algorithms  for  constructing  the  Delaunay 

triangulation  for  a  set  of  N  points  in  the  Euclidean  plane.  The 

first  algorithm  is  based  upon  a  divide-and-conquer  approach.  It 

runs  in  O(NlogN)  time,  which  is  asymptotically  optimal.  The  second 

algorithm  iteratively  adds  points  to  an  existing  triangulation, 

updating  the  triangulation  to  include  each  newly  Introduced  point  as 

2 

a  vertex.  Although  it  could  take  0(N  )  time  for  a  worst  case,  it 
runs  fairly  well  for  the  average  case. 
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Figure  2  Lawson's  example  showing  a  triangulation  over  four  co-circular 
points.  The  Voronoi  tessellation  is  shown  as  dashed  lines. 


Figure  3  Lloyd's  counter-example  to  Shamos  and  Hoey's  claim  that  a 

Delaunay  triangulatlqn  is  a  minimum  edge  length  triangulation. 
The  Voronoi  tessellation  (shown  as  dashed  lines)  indicates 
the  use  of  the  longer  diagonal  for  a  Delaunay  triangulation. 
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upper  enmang  tangent 


Figure  6  Illustration  cf  the  upper  end  lover  coonon  tangents  of  tvc 
convex  hulls. 


Figure  7  Illustration  for  the  merge  process.  r3old  lines  are 
new  Delaunay  edges  Introduced  by  the  ME£G£  algorltha. 


APPENDIX  rv.3 

A  SURVEY  OF  COLOR  VIDEO  FRAME  BUFFER  DEPLAY 
SYSTEMS  FOR  DESIGN  GRAPHICS  RESEARCH 


This  appendix  presents  the  results  of  a  comprehensive  color  display  system  survey 
conducted  at  the  General  Electric  Corporate  Research  and  Development  Center  during 
1980.  The  Information  contained  in  the  survey  report  should  be  highly  useful  in 
selecting  a  display  configuration  for  a  sensor  prediction  techniques  rerearch  system. 
It  should  be  noted  however,  that  this  survey  was  conducted  for  purposes  of  serving 
specific  needs  by  a  specific  General  Electric  organization  in  the  area  of  design 
graphics  research.  Hence,  much  of  the  author's  commentary  is  given  in  that 
context. 
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A  SURVEY  OF  COLOR  VIDEO  FRAME  BUFFER  DISPLAY  SYSTEMS 
FOR  DESIGN  GRAPHICS  RESEARCH 


PJL  AUtrtll 


INTRODUCTION 

Ttaa  Design  Graphics  Program,  whkh  it  pan  of  tha  Information  Tachnotogy  Bruch  of 
tha  Automation  and  Control  Laboratory  at  General  Electric  Corporate  Ramarch  and  De- 
valopmnnt,  hat  bats  working  with  a  DtAua  2000  series  color  vidao  frame  buffer  tytttra 
for  over  a  year.  Image  diiptayi  have  contittad  mottly  of  output  from  Syn  tha  Vision*  and 
MCvTE.BYUT  imagery  for  various  applications  pnmariN  oriented  toward  CAU'CAM  de¬ 
velopment.  Recently,  tome  new  and  very  exciting  frame  buffer  systems  have  been  developed 
by  various  vendors  and  are  being  offered  at  excellent  price/ performance  ratios.  With  the 
appearance  of  them  new  powerful  higher  resolution  systems,  it  was  deemed  necessary  that 
Design  Graphics  should  update  their  system  to  provide  a  more  suitable  environment  for 
color  video  computer  graphics  development. 

The  color  video  frame  buffer  marketplace  is  a  very  dynamic  environment  which  has 
made  information  gathering  a  difficult  tack  in  that  various  sources  will  often  have  different 
responses,  and  what  is  said  today  may  be  obsolete  tomorrow.  The  author  of  this  report  wel¬ 
comes  any  questions,  comments,  or  added  information  pertaining  tocoior  video  frame  buffer 
systems,  especially  from  those  people  who  have  had  experience  on  xnyof  the  related  systems. 


•Syntha  Vision  is  a  three-dimensional  modeling  system  developed  by  the  Mathematical  Ap¬ 
plications  Group,  Inc.  (MAGI)  of  Elmsford,  NY . 

t  MOV1E.B  YU  is  a  polygonal  display  program  primarily  used  for  movie  animation  developed 
in  the  Civil  Engineering  Department  at  Brigham  Young  University  in  Provo,  Utah. 


Manuscript  received  February  12,  1980 
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VIDEO  FRAME  SUFFER  DISPLAY  SYSTEMS  FOR  DESIGN  GRAPHICS  RESEARCH 
DeRoMont 

A  video  frame  buffrr  itkm  it  bvtkaUy  t  video  ratter  scan  display  driver  cemertd  around 
•  large  piece  of  memory  which  contain*  image  information. 


The  frame  buffer  tyitein  allow*  the  uter  to  update  and  read  bock  from  the  image  memory, 
while  at  the  tame  time,  the  entire  digital  image  information  it  sequentially  read-out  and 
converted  to  analog  video  s'gnalt  for  diipiay  on  a  TV  monitor  et  refreth  ratee  (from  30  to  SO 
timet  per  tecond). 
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I  MACK  MEMORY  AND  COLO*  MAP  ORGANIZATION 

r~t  m~mf  i1"  n - -“1 — ri — f  ‘-i  - rr  tlrr  i-itgi  iririrr  imimirt  iinttnilaltr 

the  MOM.  A  modular  MCI  low  of  memory  It  dedicated  to  an  entire  display,  such  that  om  Mt 
of  Information  rormpoods  to  a  single  pixel  location  on  tht  scrven.  The  number  of  memory 
modules  corresponds  to  the  number  of  bitt  of  Information  dedicated  to  r»ch  individual 
pint.  Tht  put!  bit  inform  woo  describee  m  odd rest.  in  r  color  map  is  hie. 


Tht  color  tabit  contiau  of  a  list  of  intensity  values.  Each  location  in  tht  table  contains 
rtiativt  inttnsitits  for  each  ltd,  green,  and  blue  color  gun.  During  tht  sequential  pixel  scan- 
out  (read),  thtM  values  art  taken  from  the  table  in  tht  order  that  the  image  memory  dictates, 
converted  to  analog  signals  then  to  video  for  display.  The  whole  image  mttnory  is  read  out, 
converted,  and  displayed  during  a  single  refresh.  There  are  some  variations  and  enhance¬ 
ments  to  this  basic  design  which  will  be  noted  in  the  system  descriptions. 
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The  system  pro earner  will  accept  commands  From  tht  host  computer,  translate  them, 
and  perform  tht  appropriate  operation*  to  the  inapt  memory.  Then  talk*  may  ranpe  in 
complexity  from  simply  *Ti tin*  a  pixel  to  enhancing  ail  tht  visible  edge*  in  a  digmred  photo¬ 
graph.  To  speed  up  the  execution  of  tome  of  them  processes.  many  vendor*  have  provided 
iptcia!  purpote  hardware  module*.  A  prim*  example  of  a  hardware  module  useful  for  Dt- 
tign  Graphic*  is  the  vector  genera  tor.  Some  vector  generator*  art  capable  of  rendering  vec¬ 
tor*  on  a  color  video  ditplay  at  rate*  of  up  to  16.000  vector*  per  ttcond. 

Many  of  tht  *y*t*m  processor*  can  alto  be  user  programmable  in  micro-code.  tier 
programmability  may  prove  to  be  a  nice  feature  for  Design  Graphic*  in  that  it  may  allow  us 
to  put  in  the  processor  capabilities  to  do  such  thing*  as  fast  parametric  surface  rendering  — 
an  operation  no  vendor  offer*  today. 

System  processor*  can  become  very  big  and  powerful  and  ihat  power  is  usually  reflected 
in  the  cost  of  the  frame  buffer  system.  In  this  survey,  a  lot  of  discussion  will  be  devoted  to 
the  system  hardware,  firmware,  and  software  facilities.  In  most  cases,  firmware  and  hard¬ 
ware  facilities  will  reflect  the  power  of  the  system  processor  and  related  hardware  modules, 
while  software  facilities  will  describe  code  that  exists  on  a  host  computer  and  is  often  FOR¬ 
TRAN  callable.  User  programmability  will  describe  capabilities  for  the  user  to  down  load 
or  directly  program  the  system  processor. 
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DCttCN  CftAMICSCON«IOBAflOMI 

Tht  mult—tt  :«  a  video  frame  toft*  sywem  for  the  Oeaiga  Graphics  Program 
not*  from  has*  scon-line  displayi  mi  inmnnive  a— adorn  to  sophiadcatad  surface  im- 
M*ry  tad  enhancement  peocsdurw.  Some  of  tht  «meid*rauoni  include: 

I-  2D  and  3D  model  space  control 

•  Viewport 

•  Window 

•  Rotation 

•  Transition 
'  Seating 

2.  Display  of  design  and  manufacturing  models 

•  Win  from*  drawing 

•  Sculptured  lurfacoi 

•  Complex  solids 

•  Finite  element  results 

•  Text  generation  and  control 

J .  Capabilities  for  various  image  generation  techniques 

•  Pixel  by  pixel 

•  Scan-line  via  run-length  encoding 
«  Polygon  and  area  flit 

4.  Ease  of  program  development  and  implementation 

•  FORTRAN  callable  software 

•  High  level  graphics  language 

»  SIGGRAPH  CORE  graphics  standard 

5.  Speed  and  ease  of  user  interaction 

•  Hardware  or  firmware  facilities  (i.e.,  room,  scroll,  or  vector  generation) 

•  Peripheral  devices  (i,e„  tablet,  light  pen  or  trackball) 

6.  Animation  capabilities  (i.e.,  cutter  path  or  assembly  visualization) 

•  Run-length  decode  facilities 

•  Image  memory  and  color  map  controls 
«  Video  disk  facilities 

7.  Image  enhancement  capabilities 

•  Edge  detection 

•  Anti-aliasing 

•  Contrast  enhancement 

•  Hue,  intensity,  and  brightness  controls 
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PMUMNARY  DESIGN  GRAPHICS  UQUBEMENTS 

The  tawwi  co— ^deration  involved  in  the  Mtecuon  of  *  vidbo  frame  buffer  system  u 
the  capability  of  displaying  the  damrabk  imagm  (SymhaVision,  MOVIE.BYU.  Sculptured 
Surfaces,  etc.)  in  a  more  effective  manner  than  the  existing  DeAnza  103000  tenet  system. 
The  most  obvious  improvement  would  be  the  increase  in  image  resolution  from  256  x  256 
(medium  resolution)  to  512  x  S  1 2  (high  resolution)  or  to  102*  x  I024(ultra-high  resolution). 
Another  improvement  that  could  significantly  enhance  image  display  is  the  increase  of  the 
color  table  depth  to  allow  for  a  greater  variation  of  intensities  over  the  range  of  a  specific 
hue.  In  particular,  changes  across  a  curving  surface  would  appear  much  smoother,  and  the 
effectivity  of  various  lighting,  texture,  edge-smoothing,  and  anti-aliasing  models  would  be 
greatly  enhanced. 

To  restrict  the  survey  to  those  vendors  that  could  successfully  fulfill  our  needs,  prelim¬ 
inary  requirements  for  selecting  a  video  frame  buffer  display  system  are  described: 

1 .  A  refresh  display  system  to  allow  continual  visual  feedback  of  image  updates. 

2.  A  workable  interface  to  the  PDPl  1/70 and  VAX  computers. 

3.  Minimum  of  512  x  S 12 pixel  resolution. 

4.  Minimum  of  8  bits  of  information  per  pixel,  thus  allowing  256  colors  to  be  displayed 
simultaneously. 

5 .  Minimum  of  5  bits  per  color  gun  in  the  color  map. 

Note  that  this  survey  does  contain  some  systems  that  do  not  satisfy  requirement  5  These 
systems  were  included  for  completeness  in  that  some  people  might  choose  to  live  with  the 
4  bits  of  color  intensity  if  the  system  is  extremely  successful  in  regard  to  other  Design 
Graphics  considerations. 
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SYSTEM  CATEGORIZATION 

In  terms  of  Dmiga  Graphics  reqairemeac*  *a nm  lyrnsw  design  wsu  price  criteria  there 
seem  to  he  4  bosk  types  of  frame  buffer  systems  available. 

1.  High  resolution  (SI2  x  512) 

2.  Ultra-high  resolution  (1024  x  1024) 

1.  High  resohmon  upgradable  (modular)  to  ultra-high  resolution 

4.  Image  processing  systems  for  both  high  and  ultra-high  resolutions 

In  some  cases,  there  is  difficulty  in  discerning  the  image  processing  systems  (4)  from  the 
image  display  systems  (I,  2.  and  3)  since  some  systems  try  to  bridge  that  gap  to  entice  both 
markets.  However,  this  is  to  our  advantage  in  that  the  Design  Graphics  considerations  pre¬ 
viously  discussed  do  reflect  a  need  for  some  image  processing  capabilities  such  as  edge  de¬ 
tection  and  anti-aliasing.  Therefore,  I  have  classified  at  image  processing  oniy  those  systems 
for  whkh  a  high  price  would  be  paid  for  powerful  facilities  that  would  do  little  to  satisfy 
our  needs. 

A  major  point  of  contention  pertains  to  the  high  versus  ultra-high  resolution  decision. 
After  some  market  study,  it  appears  that  the  ultra-high  resolution  systems  are  not  quite  ready 
for  consumption.  Most  vendors  believe  that  the  ultra-high  resolution  sy items  will  not  really 
be  effective  until  the  64k  chips  become  readily  used.  The  significance  of  this  probably  will 
not  be  felt  in  the  frame  buffer  mantel  for  about  2  or  3  years.  Many  hardware  end  firmware 
faciities  offered  with  the  high  reaolutioa  systems  simply  arc  net  available  with  ultra-high 
resolution.  Peripherals,  particularly  video  recording  hardware,  simply  does  not  exist  Tor 
1004  x  1024  displays.  Finally,  software  that  was  developed  Tor  high-resolution  pixel  data 
executes  much  slower  and  often  demands  mote  host  memory  on  ulus-high  resolution  sys¬ 
tems.  Op  the  other  hand,  the  high  reaolutioa  systems  offer  a  very  cost  effective  solution  in 
that  it  is  the  same  resolution  of  standard  television.  Thus,  most  of  the  video  equipment  re¬ 
lated  to  the  system  has  bacn  in  use  for  yaart  and  is  greatly  refined.  It  is  ciso  relatively  less 
expensive.  The  high  resolution  frame  buffer  systems  have  aia&'  been  in  use  for  many  yean 
so  that  there  arc  now  many  viable  vendors  who  ue  operating  in  a  very  competitive  market¬ 
place.  All  these  considerations  make  it  pretty  apparent  that  we  should  be  directing  ounclves 
toward  the  high  resolution  systems  while  keeping  a  clot-  eye  on  the  ulua-high  resolution 
market  dcvHoptnsnts. 
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VENDOR SURVEY 

This  Initial  survey  was  specifically  designed  to  obtain  the  general  system  conrigurations 
and  capabilities  in  order  to  determine  what  types  of  systems  satisfy  our  needs,  to  what  ex 
tent,  and  at  what  cost.  A  number  of  decisions  have  already  b.en  made  regarding  the  minimal 
requirements  and  the  high  versus  ultra-high  resolution  question.  We  can  now  cut  through  a 
lot  of  extraneous  information  and  concentrate  on  the  system  configurations  that  really  con¬ 
cern  Design  Graphics. 

For  each  system,  pertinent  information  has  been  extracted  and  entered  on  a  form  sheet 
w,  ,ch  can  be  found  in  Appendix  A.  Following  each  form  sheet  is  a  system  diagran  pro¬ 
duced  by  the  vendor  (if  one  was  available).  Note  that  all  technical  and  cost  information  has 
been  based  on  h-gh  resolution  (512  x  512)  systems.  The  basic  system  cost  refers  to  a  simple 
system  containing: 

•  512  x  5 12  Image  memory  with  at  least  8  bits  depth  at  each  pixel 

•  System  processor 

•  Interface  to  host  computer  (PDP  1 1/70) 

•  19-Inch  color  monitor  ($4000  if  not  offered  directly) 

Added  to  the  basic  system  cost  are  the  optional  peripherals  and  facilities  that  would  help 
satisfy  our  needs  resulting  in  a  total  system  cost. 

The  table  following  this  section  is  an  attempt  to  squeeze  the  tables  from  Appendix  A 
onto  a  single  sheet  of  paper  to  give  the  reader  a  very  generalized  overview  of  the  systems  sur¬ 
veyed.  Some  of  the  things  to  look  for  are: 

Approximate  Total  Price:  How  much  system  power  is  needed  in  relation  to  the  time  of 
next  foreseeable  purchase? 

Memory  Conflguralioa:  Modular  bit  planes  (5 12  x  512  X  I )  are  preferred  to  allow  a  more 
flexible  initial  purchase  with  easy  lower  cost  upgrading  later  Remember  the  8  bits  depth 
per  pixel  requirement  (no.  4,  page  7)  mentioned  earlier. 

Pixel  Access:  The  time  required  to  read  or  write  a  pixel  is  very  important  in  user  inter¬ 
action  and  critical  for  repeated  operations. 

Color  Map:  The  greater  the  number  of  intensities  allowed  per  Red-Green-Blue  color 
gun,  the  smoother  a  color  change  can  be  made.  Note  the  5  bits  per  color  gun  require¬ 
ment  (no.  5.  page 7).  Also,  the  longer  the  color  map  (the  number  in  parenthesis),  the 
greater  the  number  of  colors  that  can  be  displayed  on  the  screen  for  a  single  image  if  the 
corresponding  number  of  bits  per  pixel  are  provided. 

Programmable  Processor  May  allow  the  user  to  do  specialized  fast  processing  of  image 
generation. 

Hno*  Interface:  Look  for  DMA,  because  extended  memory  configurations  could  cause 
troubles  going  to  the  VAX  or  other  host  computer.  Also,  extended  memory  systems 
that  work  on  other  PDP  1  Is  are  more  difficult  to  implement  on  the  11/70. 
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Desirable  Facilities-  Purely  a  personal  reaction  to  the  hardware,  firmware,  and  soft¬ 
ware  facilities  a  system  has  to  offer.  The  re?  der  should  consider  the  requirements,  ex¬ 
amine  Appendix  A,  and  establish  his  own  reaction  about  the  desirable/offered  facilities 
1424  x  1024  Display  Upgrade:  If  there  is  a  strong  near-term  desire  to  display  images  at 
ultra-high  resolution,  even  for  just  test  purposes,  we  should  definitely  value  this  category 
as  a  very  high  priority  item.  If  we  want  to  hold  off  for  2  to  3  years  until  more  viable 
systems  are  available,  this  factor  can  be  considered  in  very  low  priority. 
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•MET  SYSTEM  COMMENTARIES 


Thts  section  contains  a  commentary  on  each  video  frame  buffer  system  in  which  I  will 
try  to  emphasize  the  strong  and  weak  points  of  each  system  niativt  to  tht  ntrds  of  Design 
Graphics. 

DeAaaa  IDS000  is  the  big  brother  of  the  DeAnaa  2000  now  in  use.  Relative  to  the  other 
newer  systems  in  this  survey,  most  commentaries  must  be  directed  to  the  system's  short- 
cominis  which  include: 

•  Limited  memory  configurations 

•  Restricted  and  slower  memory  access 

v  Only  4  bits  of  intensity  per  RGB  in  the  color  table 

•  Very  limited  faciities 

The  only  real  positive  thing  about  the  DeAnza  103000  is  its  low  price. 

AED  SI2  was  one  of  the  big  stars  of  the  SIGGRAPH  '79  vendor  exposition.  It  can  act 
asa  reasonably  powerful  pixel  addressable  frame  buffer  display  system,  or  as  a  sophisticated 
character  oriented  graphics  terminal  and  also  comes  with  a  Tektronix  emulator.  In  terms  of 
price  versus  performance,  the  AED  312  is  very  hard  to  beat  in  the  lower  price  range  of  tnis 
survey.  The  only  limitations  (within  the  lower  price  range)  that  I  see  are  the  maximum  of 
8  bits  depth  per  pixel. 

Three  Riven  CVD  is  basically  designed  for  the  play  back  of  animation  sequences.  The 
major  limitation  of  its  Run-Length-Encodcd  (RLE)  based  memory  is  that  man-  color  vari¬ 
ations  across  a  single  scan  line  cannot  be  executed.  Smooth  surface  imagery  generated  by 
SynthaVision  or  sculptured  surfaces  would  have  to  be  approximated  resulting  in  low  quality 
imagery.  The  CVD  docs  not  even  attempt  to  utilize  most  of  the  facilities  we  need  and  can 
only  display  64  colors  at  one  time  (equivalent  to  6  biu  of  depth  per  pixel). 

Grind)  GMR27  is  a  pretty  good  low  cost  image  display  type  of  frame  buffer  including 
a  1024  long  color  map.  However,  the  GMR27  lacks  the  facilities  of  some  newer  comp  -able 
systems. 

GrineH  GMR27S  is  the  image  processing  version  of  the  GMR27.  Basically,  we  would 
be  paying  more  for  the  GMR270  than  the  GMR27  for  image  processing  facilities  ,hat  we 
have  little  use  for. 

ADI  Ughl-50  is  a  newcomer  to  the  frame  buffer  marketplace  but  it  appears  to  be  quite 
viable  and  it  is  the  only  system  that  includes  a  NTSC  encoder.  The  RGB  intensity  control  in 
the  color  map  is  somewhat  limited  and  memoiy  planes  can  only  be  acquired  in  512  x  312  x  4 
units.  However,  the  system  does  provide  a  way  for  the  use:  to  down  load  micro  code  to 
the  TMS  9900.  At  this  point  in  time,  the  ADI  Light-30  has  not  yet  beer,  tested  as  a  consumer 
product  since  most  recent  efforts  have  involved  software  interfacing  with  CHILD,  Inc. 

LcxMata  3400  is  a  very  flexible  modular  system  that  offers  some  nice  facilities  in  a  pretty 
comfortable  price  range.  Some  of  the  options  include: 

•  Integer  zoom  controller 

•  Multiple  scroll  controllers 
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•  1024  Long  color  table  with  8  biu  per  RGB 

•  Keal-timt  edge  smoother  for  2X  and  4X  zoom 

•  Multiple  overlays 

Genkco  GL  r-3000  does  offer  some  good  facilities  at  a  fairly  reasonable  price.  However, 
aside  from  the  Grafnac  II  software  package,  the  system  offers  little  more  than  the  A£D  512 
which  costs  almost  half  as  much;  and  the  GCT-3000  cannot  compete  with  some  of  the  higher 
price  systems  in  terms  of  processor  power.  Considering  this  middle-market  price/perfor¬ 
mance  position  Genisco  has  taken  along  with  rumors  of  hardware  unreliability,  it  would  be 
more  judicious  for  Design  Graphics  to  avoid  the  GCT-3000. 

DeAan  VC50M  looks  like  an  attempt  to  move  away  from  the  norm  of  frame  buffer 
system  duign  ...  but  not  in  out  direction.  The  user  has  to  buy  a  312  x  512  x  to  image 
memory  configuration  and  at  the  same  time  be  limited  to  only  4  bits  of  intensity  per  RGB 
color  gun.  There  is  a  reasonable  vector  generator  and  character  controller,  but  little  else 
in  the  way  of  facilities  . . .  and  a  relatively  high  price  tag. 

Aydia  5214  is  the  system  to  bea.  in  terms  of  hardware,  firmware  and  software  facilities 
which  include  ID  object  transformations  and  Z-sort  hidden  surface  removal.  The  system 
even  offers  a  user  programmable  Intel  8046  with  one  Megaword  of  memory  and  the  Forth 
programming  language.  There  is  a  very  long  (2048)  color  map,  and  with  an  extra  video  card 
(coating  about  13000)  can  offer  8  bits  depth  of  intensity  per  color  gun  as  a  non-standard 
configuration.  However,  at  this  point  in  time,  the  software  packages  arc  not  yet  complete. 

Norpak  VDP  is  a  Canadian  company  that  is  new  to  the  high  level  frame  buffer  market. 
They  offer  many  nice  features  but  it  seemed  as  though  every  time  I  wanted  to  get  some  de¬ 
tail,  I  got  a  response  like  “well ...  its  not  quite  complete  yet."  For  that  kind  of  money  I'd 
want  to  see  it  completed  and  tested  before  buying. 

DeAaaa  IPS888  is  good  image  processing  system  at  a  reasonable  price,  but  really  quite 
overpriced  for  Design  Graphics  use. 

Ikoaas  is  probably  the  best  frame  buffer  system  or.  the  market  to  do  computer  graphics 
image  display  research  on.  The  possibilities  for  their  32  bit  microprocessor  along  with 
some  of  the  built-in  hardware  facilities  are  really  quite  interesting.  Ikorus  claims  to  have  de¬ 
veloped  a  system  that  is  modular  enough  to  keep  (hem  on  top  of  the  research  graphics 
market  for  at  least  -be  next  few  years.  Engineers  from  other  frame  buffer  companies  say  to 
look  for  good  things  in  the  near  future  from  Ikonas. 

Ram  Mi  MM  probably  has  the  best  vector  generator  in  the  market  (although  Aydin’t  un¬ 
tested  hardware  shows  promise).  It  also  offers  some  other  nice  facilities  like  2D  rotation, 
entity  detection  and  down  load  list  processing.  For  an  additional  54100  of  hardware  they 
will  also  offer  8  bits  of  color  depth  intensity  per  RGB  which  is  non-standard. 

C— mi  Vle»—  One/ 28  is  the  top  of  the  line  for  image  processing  systems.  They  offer  a  real¬ 
time  pen  of  «40Sg  x  4098  x  (bit  imageandamovic capability  to  viewinga  512  x  512  x  512 
bit  array.  U  is  basically  a  very  powerful  high  priced  system  to  perform  operations  we  do  not 
really  need. 
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RECOMMENDED  SYSTEMS 

In  temw  of  price/ performance  criteria,  I  see  six  video  frame  Puffer  systems  that  stand 
out  from  the  others.  In  order  of  approximate  price  they  are: 

AED  512  <19,820 

ADI  Light- 50  $23,200 
Lexidatu  3400  $25,800 
Aydin  52 16  $42,500 

Ikonas  $48,000 

Ramtek  $53,100 

The  six  systems  seem  to  fall  into  two  price  rantes: 
lower  price  range:  $19,820  to  $25,800 
higher  price  range:  $42,500  to  $53. 100 

The  lower  price  range  systems  are  good,  fast  modular  systems  that  will  satisfy  our  needs 
quite  nicely.  The  higher  price  systems  are  quite  similar,  except  that  they  have  much  bigger 
processors,  mor-  hardware  facilities  and  are  upwards  compatible  to  the  ultra-high  resolu¬ 
tion  displays. 

In  (he  higner  price  range,  Aydin  and  Ramtek  both  provide  the  best  hardware  support 
modules,  and  extensive  firmware/software  facilities-  Ramtek  seems  to  offer  a  superior  vec¬ 
tor  generator,  but  Aydin  docs  support  3D  object  transformations  and  offers  an  easily  pro¬ 
grammable  micro  which  could  certainly  prove  valuable  for  surface  generation.  One  the 
other  hand,  Ikcnas  provides  a  more  state-of-the-art  techaotogy  with  the  32  bit  200  Nsec 
cycle  processor  with  fast  hardware  multiplier  and  various  special  purpose  hardware  facilities 
including  3D  transformations.  Ikonas  also  has  its  image  memory  configured  in  such  a  way 
that  an  ultra-high  resolution  upgrade  would  only  require  a  monitor  change  and  setting  a 
software  switch.  All  other  available  systems  require  some  hardware  changes.  So  at  the 
higher  price  range  it  comes  down  to  a  preference  between  Aydin's  already  developed  facilities 
to  Ikonas’s  more  advanced  engineering. 

At  the  lower  price  range,  the  task  of  selecting  a  better  system  becomes  more  difficult. 
In  my  opinion,  either  (he  AED  5 1 2,  ADI  Light  50  or  the  Lexidata  3400  would  be  good  selec¬ 
tion  that  would  stand  us  m  good  stead  for  the  next  two  to  three  years.  To  help  with  this  de¬ 
cision,  I  believe  that  we  should  look  further  into  any  special  deals  or  company  relationships 
that  might  provide  more  incentive  one  way  or  another.  The  following  section  will  d>scu  s  a 
suggested  approach  to  making  the  final  selection. 
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THE  NEXTSTEP 

I  believe  Use  not  logical  ittp  would  bt  to  evaluate  our  present  needs  and  plact  into  per- 
rpective  our  near  um  goalt  and  our  long  term  goals.  Using  that  criteria,  we  should  de- 
tannine  whether  we  need  to  put  down  the  money  for  a  high  powered  system,  or  whether  we 
can  satisfactorily  pursue  our  goals  with  a  lower  coat  system,  la  either  case,  the  following 
questions  should  be  asked  of  each  critical  vendor  and  responses  requasted  in  writing  along 
with  a  formal  detailed  quote: 

1.  What  is  the  delivery  lead  time? 

2.  What  arc  the  levels  and  coats  of  factory  and  fleldiervtee? 

3.  What  Is  the  actual  mechanical  packaging  (l.e.,  parts  supplied)? 

4.  Are  there  any  special  company  relationships  to  consider? 

3.  Are  there  any  special  price  cuu  to  consider? 

i.  What  are  the  planned  future  capabilities  and  to  what  extent  or  coat  will  they  be  avail¬ 
able  to  us? 

The  responses  to  lhaee  questions  may  well  provide  the  thrust  to  select  one  vendor  over 
soother.  For  example,  Crmigraphlc*  received  a  handsome  discount  from  Lexidau  with 
the  intent  of  future  quantity  buying.  Some  of  the  vendors  suggested  a  company  contract 
that  would  allow  us  to  utilise  their  software  package*  on  many  in-house  systems  for  one  set 
price.  Ikonas,  for  one,  has  stated  that  it  is  virtually  impossible  to  deliver  s  system  before 
19M.  We  must  understand  all  these  factors,  evaluate  them,  and  then  compare  the  trade¬ 
offs  in  order  to  make  a  final  selection. 
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OTHER  HARDWARE  TO  CONSIDER 

Thor*  arc  tout  othar  devices  related  to  video  frame  buffer  systems  that  may  help  jatiify 
tome  of  our  requirements.  They  all  represent  way*  by  wtiich  we  may  record  our  images 
for  communication  or  special  applications. 

NTSC  Encoder* 

The  purpoee  of  an  NTSC  encoder  is  to  transform  the  RGB  video  signals  that  are  dis¬ 
played  on  high  quality  monitors,  to  standard  television  signals  so  we  may  record  i mater/ 
directly.  1  found  only  two  encoders  that  told  for  under  $3000. 

Leans  lac.  offers  NTSC  encoder  that  “was  sped  deal  ly  destined  to  encode  high  resolu¬ 
tion  color  graphic  computer  displays  irrcgardleu  of  scan  rates."  They  do  claim  that  it 
will  encode  1024  x  1024  resolution  displays.  Cost:  $1393 

VMM  Modular  Systems  offers  an  NTSC  encoder  that  does  not  presently  handle 
1024  x  1(04  resolution  displays,  but  they  suggest  that  they  will  have  that  capability  in 
2  to  3  months.  Cast:  $940 

VMtnDMa 

A  video  disk  is  a  disk  unit  especially  designed  to  store  end  play  back  video  images.  It  is 
presently  being  used  quite  successfully  for  computer  animation  in  that  it  allows  the  storing 
of  images  at  a  slow  rate  and  will  play  them  back  at  a  real-time  rate.  This  allows  for  a  much 
more  flexible  recording  system  than  the  traditional  movie  frame-by-frame  photography 
method  because  there  is  no  wait  for  film  processing,  and  also  because  the  animator  may 
selectively  edit  random  frames.  The  major  problem  with  video  du,.s  is  (hat  most  of  them 
use  laser  technology  making  them  quite  expensive.  A  price  tag  above  $100,000  is  not 
unusual  for  a  good  digital  system.  However,  there  are  some  alternatives. 

Ofctal  offers  an  analog  video  disk  system  for  $40,000  which  is  being  used  in  various 
places  including  Cornell's  lab  for  Computer  Graphics.  As  I  understand  it,  the  analog 
nature  of  Oktal's  system  requires  so  much  tweeking  that  a  video  engineer  should  be  on 
hand  most  of  the  time. 

Eigen  Video  recently  announced  a  lower  Quality  low  cost  solution  in  the  form  of  a  mag¬ 
netic  disk.  The  monochrome  recorder  costs  about  $16,000,  and  the  additional  time  base 
corrector  for  color  recordirg  boosts  that  system's  cost  to  $24,300.  The  Eigen  system 
cun  record  up  to  300  frames  which  is  good  for  about  10  seconds  of  animation.  The  mag¬ 
netic  cassettes  list  approximately  100  hours  before  they  must  be  rebuilt  at  a  cost  of 
$10 ea  h. 

The  Gi  television  development  group  in  Portsmouth  is  looking  into  video  disks  and  are 
planning  on  buying  one  alteady  develonid  elsewhere.  If  animation  is  a  definite  requirement, 
1  would  teccmmenci  finding  it  at  a  system  level  because  the  costs  of  video  disks  are  so  high. 
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Soma  framt  buffer  systems  now  or  toon  will  offer  disk  control krt  in  combination  with 
hardware  run-length  decoders  which  may  well  utitfy  mott  play  back  animation  speed 
requirements.  A  lower  coat  tohition  would  be  the  Three  Rivers’  CVD  frame  buffer  tytttm 
which  is  specially  built  for  animation  (discussed  earlier  in  this  survey).  It  could  be  pur¬ 
chased  as  a  second  frame  buffer  for  SI  5.  COO  end  used  totally  for  animation  display. 

Hard  Capy 

At  this  point  in  time,  color  video  hard  copies  ere  hard  to  find  in  a  reasonable  price  ranee 
relative  to  their  monochrome  counterparts.  The  most  notable  systems  available  today  are: 
Trilag  lac.  offers  a  system  called  COLORPLOT  100  which  is  bated  on  a  impact  printer 
costing  S9900  It  produces  a  copy  with  100  dots  per  inch  vertical  and  horizontal  resolu¬ 
tion  in  about )  minutes  costing  about  Sc. 

Dana  Instruments  has  a  hard  copy  unit  that  utilises  a  Polaroid  camera  to  make  high 
quality  I  *  10  color  photographs  at  about  $5.00  per  picture.  The  system  will  also  allow 
for  35  mm  slides  to  be  taken  and  costs  about  $16,000. 

Matrix  Instruments  produces  a  hard  copy  system  very  similar  to  the  Dunn  but  with  a 
basic  system  cost  of  about  $12,100.  It  has  the  additional  capability  of  formatting  mul¬ 
tiple  images  (2,  4,  6,  9,  25)  on  a  single  8  x  10  Polaroid  print  which  could  result  in  sub¬ 
stantial  film  cost  savings.  Unfortunately,  each  formatter  costs  $1000.  Additional  for¬ 
matters  arc  available  for  35  mm  slide  ( I -image-53000)  and  microfiche  (92  images-$l500). 
The  total  cost  for  a  good  system  is  about  $20,000. 

Xerox  makes  a  color  copier  that  will  accept  serial  computer  data  and  output  a  100  dot 
per  inch  image.  The  system  can  also  produce  35  mm  slides  and  can  operate  in  the 
normal  copying  format.  Nice  system  for  about  $25,000. 

AppUcon  now  advertises  an  ink-jet  plotter  for  about  $40,000  that  will  make  some  nice 
color  copies.  Some  examples  of  the  ink-jet  plotter  output  are  on  the  wall  in  the  Design 
Graphics  Lab. 
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LOW-COST  CO  LOS  VIOCO  DISPLAY  SYSTEMS 

At  this  poiM,  I  would  Ukt  to  note  that  tom  of  the  system  olroady  mentioned  my  won 
satisfy  tho  tow<oM  taquiitments  white  offerina  upward  configuration  possibilities  in  a 
modular  fashion. 

For  uampte,  Advaooad  Etectronio  Design,  Iik.  (AED)  can  confiture  a  high  resolution 
(312  x  312)  system  with  3  bits  of  depth  at  each  pixel,  POP  1 1  interface,  14-inch  color  mon¬ 
itor,  powerful  firmware  capabilities  along  with  a  Tektronix  Piot-10  emulator  for  under 
$13,000  —  not  indudiag  quantity  discounts.  With  2  bits  of  depth,  the  cost  is  test  than 
$12,300. 

Applied  Dynamics  International  (ADI)  can  put  together  a  hi|h  resolution  (312  x  312) 
system  with  4  bits  of  depth  at  each  pixel,  PDF  1 1  interface,  14-inch  color  monitor,  NTSC 
encoder,  power  firmware  and  a  Tektronix  emulator  (Tek-Light)  with  some  nice  extensions 
for  around  $16,000.' 

These  systems  represent  the  upper-end  of  the  low  coet  frame  buffer  spectrum,  but  they 
do  offer  some  very  nice  features  in  a  very  cost  effective  manner. 
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TABLES  AND  DIAGRAMS  DESCRIBING 
COLOR  VIDEO  FRAME  BUFFER  DISPLAY  SYSTEMS 
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SYSTEM 

DeAnza  10S000 

CONTACTS  Rick  pizza  Chuck  Nordby 

San  JoM,  CA  (406)  263r?155 

IMAGE  MEMOHY 

CONFKJ'JRADONS 

AND  COSTS 

2  basic  color  systaas 

512  x  512  x  8  *  color  output  6  bit*  (3-F  3-c  2-B) 

512  x  511  x  32  •*  color  output  12  bita  (4  aach  color) 

FROCEESOR 
(SEC  ATTACHED) 

MEMORY  ACCESS 

ONE  PIXEL  VO  AFTER  INITIALIZATION  1:2  Ma  (average  1.8  Ha) 

Manory  allocatad  lina  by  line  along  DMA  interface 

REFRESH  RATE 

INTERLACE 

10  Hz 

2:1 

HOST 

interfaces 

PDF  11/70  "Standard  PDP-11  Unibua  intarfaea"  -  Unibue  registers 

VAX  VMS  I/O  Driver  $650  or  DMA  Board  - 

peripherals 

Joystick  $875 

FACILITIES 

h-haadware 

F-FIRMWARE 

5- SOFTWARE 
U-USER/PROCESSOA 

H 

S 

H 

H 

Color  Table  256  x  12  (4  par  gun)  for  512  x  512  x  B  Syetaai 

1024  x  12  14  par  gun)  for  512  x  512  x  12 

VDLIB  -  Dec.  11  coapatibla  $250 

-  Magnification  function  -  real  valua  -  interpolated 

-  Polygon  fill 

-  user  interaction 

Cursor 

Alpha  -  Nuaieric  Saner* tor 

MONITORS 

NTSC  ENCODER 

HARD  CORY  FACILITY 

Recoeatend  Dunn 

COST  OF  SASIC  SYSTEM 

DESIRED  EXTRAS 

TOTAL  COST 

512  x  51?  x  12  Syatai  to  allow  4  bits  par  RGB  -><$18,500 

VDLIB,  Joystick  $19,525 

COMMENTS 
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SYSTEM 

A£D  512 

IMAGE  MEMORY 

CONFIGURATIONS 
AND  COSTS 


CONTACTS  Gary  Wilson  (Sales)  Pet*  Karri*  IEng.) 

_ Bedford,  Hass.  1617)  275-6401  Sunnyvale,  CA  1406)  733-3555 

512  x  512  x  1  bit  plana*  -  Max.  3  $885 

(possible  CO  hook  uo  3  set*  of  8) 


PROCESSOR 
(SEE  ATTACHED) 

MEMORY  ACCESS ' 


6502A  Micro  *  no'  ustr  pr  jgrammabl* 

ONE  PIXEL  I/O  AFTER  INITIALIZATION  Una  -  9c  to  100  Ms  inituT 
-  30  M*  subsequent  -  1  Mn  RLE  -  5  Ms  per  pixel 


REFRESH  RATE 

INTERLACE 

HOST 

INTERFACES 

PERIPHERALS 


POP  11/70  DMA  Unibus  Interface 


Keyboard  wish  numeric  pad  and  joystick  (included) 


FACILITIES 

H-HARDWARE 

FFIRMWARE 

S-SOFTWARE 

U-USER/PROCESSOR 


F  Included:  Vector  Generation  -  9  M»/pixel  after  initial 
Scroll 

Zoom  2x.  3x,  4x.  5x  . . .  16x 
Polygon  fill  -  after  vector* 

Area  fill 

Pun  Length  Encode  and  Decode 

Cursor  -  Joystick  control 

"Area  of  interest"  -  similar  to  window 

Circle  generator 

H  Color  table  256  x  24  (8  per  gun) 


MONITORS 


NTSC  ENCOOER _ 

HARO  COPY  FACILITY 
COST  OF  BASIC  SYSTEM 
DESIRED  EXTRAS 
TOTAL  COST 
COMMENTS 


N/A  Yet _ 

Working  on  Applicon  and  Hard  Disk  interfaces 
512  x  512  x  8 

included 


Tektronix  mulation  mode  -  Unmodified  Plot-10  (4000) 
Included 

No  character  generator 

Working  on  floppy  disk  interface  to  unibus  $4,000 


CONJURATIONS 

ANO  COSTS 

Mamory  acta  aa  add-on  to  PDP-11  Maory.  It  la  aaich  amallar  than  othar 
image  aamoriaa  alnca  diaplay  information  la  contacted  tc  RLE  format. 

Control!  Run-Langth  encoding  and  daooding. 

MEMORY  ACCESS 

ONE  RIXEi  I/O  AFTER  INITIALIZATION 

-< 


W 

fc! 

fv! 

t'i 


l 
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SYSTEM 

Grin* 11  OMR-270 


IMAOI  MEMORY 


CONFIGURATIONS 
AND  COSTS 


I  CONTACTS 

1  (AON)  2*3-9920  San  Jos* 


512  a  512  x  1  bit  plana* 

quantities,  1  •  3S00  2  -  51200  3  •  51600  4  -  52000 
(Max  32  planes) 


(SEC  ATTACHED) 


Not  User  Proqrf  able 


MEMORY  ACCESS  GN6  PIXEL  WO  AFTER  INITIALIZATION  6  Mxae  first  pixel 

1.5  Haac  subsequent  pixels 


REFRESH  RATE  I  30  Hz  60  Hi 

INTERLACE 


FOR  11/TO  Gat  Mils  free  Dae  -  Interface  Logic 


VAX  similar  - 


FACILITIES 

H-HAROWARE 
F"1  'WARE 
S  /WARE 
>udCR(PROCE3SOR 


H  Zoca  (2x,  4x,  Bx)  and  Pan  $1,200 

Kith  cursor  to  denote  screen  center 
H  (3)  256  x  24  color  tables  (use  only  one  at  a  time)  $1,600 

H  Image  runction  Neeory  Card  $1600  (Video  Driver  card  $1200) 

-  3  1024  x  S  color  tables 

-  capability  for  split  screen  and  ieage  toqgllnq 

H  Image  Processor  Card  (oiltiply,  divide..,)  $2,200 

H  Image  Analyser  Card  (histograms-..)  $1,400 

i  window  read  and  write  control  included 


buy  and  sell  conracs 


NTSC  ENCODER 
HARO  COPY  FACILITY 
COST  OF  BASIC  SYETT 
OCSiRXD  EXTRAS 
TOTAL  COST 
COMMENTS 


oce/par,  Image  Function  Neeory  Card, 
■video  Drive  Card,  Joystick,  Trackball 


$15,000 


Vidro  Digitising  Option  $1200 


Graphic 

Overlay 


SYSTEM 

ADI  -  Light  50 

IMAOC  MEMORY 


CONFIGURATIONS 
AND  COSTS 


ICONTACTS 


Harold  F.  Clearwater*  (Main) 
Boh  Ray  -  local  talesman 
Low  11 ,  MA  '617)  455-2576 


512  x  £12  x  4  board  -  Hax  4  now,  6  future 
(1024  x  1024  display  net  announcad  yat) 


PROCESSOR  16  bit  Micro  -  TMS  9900  -  U*ar  can  download 

ATACHE0)  6k  ROM  £  4k  RAM  which  can  be  incraaand 

MEMORY  ACCESS  ONE  RIXEL  I/O  AFTER  INIT1AUZATION  1.2  Ms 


REFRESH  RATE 

INTERLACE 

HOST 

INTERFACES 

PERIPHERALS 


POP  11/T0  intarfaca  w/micro  ■*  hoat  attachment  ♦  boat  I/O  but  $2,500 
VAX  M/A  ” 


Keyboard  with  nimaric  pad  £  16  function  switches 
Joystick 


FACILITIES 

H-HAROWARE 

FFIPWWARE 

S-SOFTWARE 

U-USENPROCESSOR 


H  1  pix  (vartical)  by  16  pix  (horironcai)  scroll  and  room  2x,  4x,  8x 

$500 

H  Fast  Elsment  Canarator  (fill  2.5  Ma/pixal) (vectors  1.3  Mx/pixali 

$3,000 

H  Graphics  Overlay  -  RS170  (casiara  innut)  (512  x  512  x  1  piano) 

$2,500 

H  Color  table  256  x  16  (standard)  1024  x  16  (optional  -) 

(5-rad  6-blue  5-green) 

F  Included 

-  arbitrary  (real)  scaling  lx  to  256x  -  must  rabui Id  image 

-  generate  circles,  area,  characters,  rectangles,  conics 
•  area  fill  and  rectangle  fill 

-  cross-hair  cursor 

-  multiple  views  with  a  2D  window  (function  of  soon  6  scroll) 


MONITORS 


NTSC  ENC00ER 
HARO  COPY  FACILITY 
COST  OF  SASIC  SYSTEM 
DESIRED  EXTRAS 
TOTAL  COST 
COMMENTS 


Tektronix  hard  copy-  hook-up  RS170  4  Child  Systms 

Soom/scroll ,  Prog,  element  generator, 
keyboard,  joystick 


$19,000 


$23,300 


Teklight  -  Tek  emulator  Prom 
Can  overlay  text 
Child  System 

S  6  S  electronics  dropped  Genisco  6  Ramtek  •  ADI  locks  good. 
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HijfileaHr  (MUM  Id  Mil  Bftlmm  MOO 


SYSTEM  r 

bexidata  MOO 

CONTACTS  Art  Frankie 

burlington,  Mil.  (417)272-2700 

MACS  MUMMY 

configurations 

AND  COSTS 

512  >  111  1 1  plant  Mu  14)  *1120 

1024  a  1024  a  1  plana  (IMa  4)  *44*0 

MOetHM 

ISM  ATTACHED) 

12  bit  Rlero  with  Ik- Ik  PROM  A  Ik  SAM  far  Writable  Control,  stora 
difficult  uaer  pcofrabniif  (2k-12k) 

muonY  uccm 

ONI  FQtIL  MO  AFTSR  NMTlAUIATtON  si  Na 

RSFRESH  RATE 
INTERLACE 

(512)  10  Ha  (512)  40  Na  (1024)  10  Ha 

2.1  1.1  2.1 

HOST 

INTERFACES 

FOR  HITS  rarn— nd  1*  bit  oarallel  DMA  lntarfaca  to  Unibui  *1200 

VAX  alnllar 

PERIPHERALS 

W/M222  joystick  $1400 

keyboard  $  too 

trackball  $2500 

tablet  $2000 

(dropped  lightpen  aupport) 

FACILITIES 

H-HARDWAM 

F-FlRMWARI 

vsoftwari 

IMiSER/PROCESSOR 

H 

« 

F 

loon  (lx,  2a,  lx,  16x)  and  Scroll  $1000 

for  •  bit  depth  254  x  24  (4  per  KM) 

Maxlnua  of  1024  x  24  aiaultanaoua  $2405 

•link  controller 

Multiple  overlay! 

DAM  add-on  for  oicro  $  500 

langa  DiapLay  operating  Syatan  -  acceaaible  via  Software  Driver 

which  ia  raaident  on  host. 

•vector  generation  each  vector  .  lOMa  +  2Ma  per  pixel 
-raxp  feature  for  color  look-up 
-aovie  feature  uaing  soon  and  scroll 

Cursor  $  700 

monitors 

5X2*  resolution  $  3 <000 

1024*  resolution  $8*000  to  $13,000 

NTSC  ENCODER 

optional  $  2,000 

HARO  CORY  FACILITY 

Tektronix  .hardcopy  $  7 , 500 

COST  OF  SASIC  SYSTEM 

0C8IRED  EXTRAS 

TOTAL  COST 

20 ,000 

Joystick,  keyboard,  tablet,  soon/scroll,  ram  add-on  ^$25,800 

GENISCO  DIGITAL  DISPLAY  SYSTEMS 


Setting  a  new  criteria  of 
modularity,  display  dynamics, 
performance,  reliability, 
processing  speed  and 
cost-effectiveness! 


Compietelv  programmable.  Gemsco  0>gi tai  Grapnic 
Display  Systems  are  modular ly  expandable  to  cover 
the  widest  range  of  application  requirements  You 
specify  the  features  and  options  you  need  Gemsco 
graphic  dispidv  experts,  using  functionally  proven 
‘building  block  modules,  tailor  systems  that  cost- 
effectively  answer  that  need  .  dynamically,  ef  ?  *  - 
CnJntly  a''d  reliably* 

Basic  3000  System,  Consists  of  tne  proper  CPU  in¬ 
terface  fast  entry  MGS/ RAM  Refresh  Memory  Mod¬ 
ules  -  with  read/write,  word  or  pit  capabilities, 
automatic  DMA  access  for  block  transfers  to  633k 
16-bit  words/second  -  and  the  Video  Control,  that 
generates  the  oasic  system  timing  and  formats  the 
output  for  RSI  70  waveforms 

Pre-Processor  Options.  The  GCT-3C1 1  Programmable 
Graphic  Processor.  under  control  of  -ts  Own  program 
that  is  easily  modified,  converts  data  that  ncluaes 


both  vectors  and  characters,  and  routes  <t  to  the 
memory  modules  A  Hardware  Character/ Vector 
Generator  i$  also  available  for  very  fast  dynamic 
applications. 

PottProceoor  Options.  Monitor  Control  Modules,  .n 
a  number  of  optional  configu  utions.  provide  added 
capability  to  the  system  such  as  Video  Gating.  C«r 
cumtnP"  .  Video  Lookup  and  Reaoback  Overlay. 
Cursor  Conttoi.  DAC  s.  RS232  I/O  Ports,  arn  RS'70 
composite  video  waveforms.  A  Scroll  and  Zoom  - 
by  image  or  piane  -  is  aiso  optionally  avanapie. 
Interactive  Device  Options.  RS232  compatible  inter¬ 
active  devices  like  an  ASCII  Alphanumeric  Keyboard 
with  16  lighted  function  switches.  Trackball  ana  Joy¬ 
stick.  and  an  1 1"  x  1 1“  Graphic  Data  Tablet  are  avail¬ 
able  from  Genisco. 

For  particulars  cn  your  specific  digital  graphic  display 
requirements,  contact  Genisco  -  a  njme  that  has 
stood  for  advanced  technology  over  the  oast  30  year: 
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SYSTEM 

Gani.co  GOT- 3000 

Stu  Robert  (UP)  Jo.  Tub lan  Bob  rr»y  (UPI 

CONTACTS  (stiiMn)  Out  Jon.i  Dav#  Paulay 

U5W.11,  M,|  (617)459-2578  Irvin.,  CA  _ 

IMAGE  MEMORY 

CONFIGURATIONS 

AND  COSTS 

512  v  512  *  1  plan.  $1500 

1024  x  1024  x  1  plan.  $2500 

Max  of  14  plan*,  in  2  chain* 

PROCESSOR 
(SEE  ATTACKED) 

jroqn— abl.  Graphic*  Proc.tior  (PGP) 

Graphic  Operating  sy.t.a  talt.a  about  1/2  of  th.  4k  RAM 

MEMORY  ACCESS 

ONK  PIXfL  MO  AFTIR  INITIAUZATION 

3 

REFRESH  RATE 

INTERLACE 

(512)  60  Preferred  40,  20  Ha  (1024)  20  Hi  j 

11  Y.a  2:1  Y.B  2:1  j 

HOST 

INTERFACES 

POP  11/70  (RSXU-N  availabl.)  Hot.:  Driver  v/dwoi  •  $560  $1700  r; 

VAX  $1700 

PERIPHERALS 

keyboard  $1350 
trackball  $2900 
joy.tick  $1000 
tabl.c  $1500 

C  \ 


h 

k.. 

%■ 


FACILITIES 

HHAROWARE 

FFIRMWARE 

S- SOFTWARE 
U-USER/PROCESSOR 

H 

H 

H 

H 

H 

Charact.r/vactor  generator  *vl0  Mi  per  pix.l 

Scroll  and  2ooa  (2x,  4x,  Sx) 

Color  Tabl.  256  x  24 

Fill  Nod.  -  will  fill  batwaan  vac tori  (aax  4  planei) 

Curior  and  blink  control  in  lieu  of  i.cond  curior 

$2000  j: 

$1500  !j 

inelua.d  j 

included  1 

includ.d  I i 

$ 

Graf  pac  IX  -  fortran  callabl.  graphic*  lubroutin.  library  I  ) 

include* :  aria  fill,  ion.  2d  trinilation* ,  j  ■ 

eurv.*,  lin.a,  vector*,  t.xt  control  $3000  •  ; 

li 
]  ! 

h 
:  1 
!  1 
’  3 

'  "j 

MONITORS 

512  x  512 

1024  x  1024 

$  3,240  5  3 

$15,200  ’ 

NTSC  ENCOOER 

$  4,500 

HARO  COPY  FACILITY 

i  3 
'  1 

COST  OF  SASIC  SYSTEM 

$20,700  j 

OESIRED  EXTRAS 

TOTAL  COST 

k.yboaid,  joy.tick,  tabl.t,  charactar/vactor  generator, 
xooa/acroll,  Graf  pac  II 

$31,050  j  j 

COMMENTS 

-vactora  auat  b*  .raa.d  for  aovaxant,  "ardwar.  ntioring  i  1 
-eharact.r  control*:  lx. ,.16x  xooa,  90*  rotation,  progranabl.  Xonti  ,  4 
-plot-10  aaulator  availabl.  i  1 
-ruaor.  of  hardware  problau  free  ua.ra  ,  j 

1 


BBfBrrpr- 


i  ii  uniinn  ni  «av.»tK<« 


DeAnsa  VC5000 


v  ‘ 

?•- 

f 

C 

IMAGE  MEMORY 

CONFIGURATIONS 

AN0  COSTS 

1  , 
t 

PROCESSOR 

MEMORY  ACCESS 

%  i 

REFRESH  RATE 

| 

INTERLACE 

HOST 

I  i 

INTERFACES 

PERIPHERALS 


■CONTACTS  Rick  Fixsa  Qiuck  Nordby 
San  Josd,  CA  (40S) 263-7155 


All  systana  in  512  x  512  x  16 

System  SI.  Monochroaaa  -  8  bits  intensity,  4  overlay,  4  sux. 
System  42.  Colot  12  bits  RGB  (4  par  gun) ,  4  overlay 


LSI-11  totally  ussr  prog  r  fab  la  (24K  bytes) 


POP  11/70  MX 


IfTTTT? 


Joystick  (cursor) 
ADM- 3  Dumb  Tsminal 


FACILITIES 

H-HAROWARE 

7-FIRMWARE 

S-SOFTWARE 

UUSER/PROCESSOR 


U  Color  Tablet  with  Iaa9a  Transform  Control  -  "?eudo  Color’ 

Monochrome  2048  x  8  4400 
Color  1024  x  12  (4  psr  color)  41,950 
Color  2048  x  12  (4  pur  color)  42,000 


H  Dual  Cursor  (different  inodes) 

H  Zoo*  (2x,  4x,  8x)  and  Scroll 
S  Vector  Generation  8.5  Me/pixel 
r  Character  Control  (41,000)  w/Color 


41,400 
included 
included 
4) ,500 


MONITORS 


NTSC  ENC00ER 


HARO  COPY  FACILITY 


COST  OF  SASIC  SYSTEM 
DESIRED  EXTRAS 
TOTAL  COST 


including  1024  x  12  color  table 
joystick,  terminal ,  cu-sor 


COMMENTS 

-Designed  to  stand  alone  -  termUial  and  floppy 
-Image  from  floppy  approx.  11  sec. 

IV. 8-37 

%fil.40O 

434,950 


Typical  Modal  5216  System  Configurations 


srAMO^iOM#  im*04  moctumo  mm# 


Machantead  Configuration 

The  Msow  UK  •»  eaeugad  m  •  if  n  ccown  9  men 
'•dtffdfmoginfoupeftfifif.  Ela*an  modMa  o*  wi 
cardtuMfrfO'OMMaprfcnMif  —  tom* 
nudum  raquua oniyhaMeardf  Ueiomiaa 
cnana  may  Ba  Mat  logMlwr  to  form  a  M  moduia 
conagmaiion  Eaen  cnaaaa  «  wl  oowaiaa  oy 
maana  ol  a  taat  mountad.  aan-eanlamad  aowai 
moduia  At  eomachona  art  mada  via  lx  aid- 
moxnad  eannacraia  Sida  awry  d  s*i  rin  mgoi.ia* 
anna  M  accata  to  ca'dadga  corwaciiona  a  •  wai 
at.  my  ovidiao  Oac»  otanai  woao  ma  cttaavait 
avtondad  on  ma  noa* 


aaawc  avaraw 


maoi  srtrin 


IV.  3-38 


*  / 


Half  Hubert  (Sales)  Cuatowr: 


SYSTEM 

Aydin  5216 

CONTACTS  (617)  6*9-6754  Hon  Helen  eon  use 

Nut.  (617)  649-7472  (how)  (617)  481-9511  Ext.  6419 

MUOE  MEMORY 

CONFIGURATIONS 

AMO  COSTS 

512  x  512  x  1  bit  plane  (no  aeaae)  $2000  to  $2200 

1024  x  1024  x  1  bit  plane  $2500  to  $2750 

HaxiMa  of  16  plane* 

PROCESSOR 
(SEE  ATTACHED) 

Intel  8086  -  up  to  1  Mag*  word  ucer  progra—able  using  Forth  language 

MEMORY  ACCESS 

ONE  PIXEL  VO  AFTER  INITIALIZATION  5  l  Ha 

REFRESH  RATE 

INTERLACE 

(256x256)  60  Kx  (256x256)  30  Hz  (512x512)  60  Kz  (512x512)  30  Hz  (1024x1024)  30  Hz 

ltl  2:1  1:1  2:1  2:1 

HOST 

INTERFACES 

POP  11/70  DMA  -  DRUB  interface  $  850 

VAX  Saw  $  850 

PERIPHERALS 

keyboard  with  10  function  key*  $  900 
lightpen  $  995 

45  fimction  keys  $835  90  function  key*  $1470 
joystick  $  690 
trackball  $2895 

FACILITIES 

H-HAMOWARE  r  t 

FIRMWARE 
^SOFTWARE 
IMJ8ER/PROCESSOR 

F  L 

H 

H 

H 

H 

K 

S 

Vector  c  Circle  Generator  (10*  firnamre  spaed?)  $3500 

team  Control  (2x,  4x,  8x,  16x)  and  Scroll  FQ 

Alphanumeric  Channel  Module 

Cursor  included  with  device  controller 

Color  Table  2048  x  12  (or  4096  x  6)  4  per  RGB 
-Additional  nodule e  to  provide  8  par  RGB  =  $3000 

AYCKAF  (4  different  versions)  SIGGRAPH/CORE  Version  S750C 

-polygon  fill  with  firwere 

-E-depth  sort  of  polygon  filled  areas  (hidden  surface) 

-curve  fitting  4  generation,  conics,  polar  coordinates 
-coloi  control  with  percent  of  hue,  intensity  (  saturation 
-2D  c  3D  Manipulation  and  windowing 

MONITORS 

(8024)  -  13"  diagonal  -  800  TV  line*  S3073 

(8025)  -  19"  diagonal  -  300  TV  line*  $3995  - 

(8026)  -  19*  oiagonal  -  1000  TV  line*  $7435 

NTSC  ENCODER 

HARO  CORY  FACILITY 

COST  OF  SASIC  SYSTEM 

DESIRED  EXTRAS 

TOTAL  COST 

with  8  fcii  far  RGB  color  table  $30,000 

keyboard,  lightpen,  joy*tiek,  vector  generator 

zooa  controller  w/»croll,  AIGRAF  CORE  Jfc$42,SOO 

COMMENTS 

upgrade  to  ultra-high  (1014x1024'  resolution  requires: 

-change  iom  PROM  chips,  Unaware  t  possibly  backplane 

IV. 3-33 


Joystick 


SYSTEM 

Norpak  vop 

CONTACTS  8111  Lalond 

Pakanha*  (Ottawa)  1-613-624-5507,  5570 

IMAGE  MEMORY 

CONFIGURATIONS 

AND  COSTS 

Two  baaic  conf igurationa 

newer i  512  a  512  a  4  for  512  or  1024  diaplaya  $3500 

oldan  1024  x  512  x  1  for  512  dlaplaya  only  %$17S0 

PROCESSOR 
(SEE  ATTACHEO) 

Sit  alica  micro-Fortran  calla  to  accaaa  micro- inatructiona 

Not  uaar  programmable  -  can  down  load  aoaiawhat 

MEMORY  ACCESS 

ONE  PIXEL  I/O  AFTER  INITIALIZATION  1.5  Ma 

REFRESH  RATE 

INTERLACE 

(512)  25,  30  Hz  SO,  60  Hz  (1021)  25,  30  Hz 

wmmmmsmmmmmmmmmmmmm 

HOST 

INTERFACES 

POP  11/70  Modified  DRUB  -  DMA  -  Limited  to  DEC  ( a )  $2800 

VAX 

PERIPHERALS 

keyboard  with  nuaeric/cureor  pad,  32  function  avitehaa,  8  lighta  $1300 
trackball  $3300 
joyatiek  $1900 
tablet  $2000 
touch  eaneitive  dlaplay  (future) 

FACILITIES 

H-HARDWARE 

F-FIRMWARE 

S-SOFTWAHE 

U-U3ER/PROCESSOR 

F 

H 

S 

S 

Firmware  included 

-point a,  vectora,  area,  polygona,  text  -1.5  Ma*  per  pixel 
-Scroll  (w/hardware)  -  each  bit  plane  aeparataly 
-ZOOM  2x,  4x,  8x  (w/hardwara) 

-can  uee  to  do  aubwindowa  on  acraen  (vvport) 

-Polygon  fill  and  area  fill 
-Run-length  encode  and  decode 
-Curaor  in  overlay 

Color  table  256  X  24  diaplay  and  256  x  4  overlay  lncl. 

Fortran  callable  routinea  to  accaaa  micro- inatructiona  $  350 

S1GGRAPH  CORE  -  not  complete,  waiting  on  SIGGRAPH 

MONITORS 

Racostaand  Conracv 

NTSC  ENCOOER 

HARO  COPY  FACILITY 

Micro-controlled  interactive  input  t  Ra232  output  tinware  drive  $3500 

COST  OF  SASIC  SYSTEM 

DESiREO  EXTRAS 

TOTAL  COST 

-$35,000 

keyboard,  joyatick,  tablet,  Fortran  interface, 

input/output  drive  (need  for  peripherala)  -$44,000 

COMMENTS 

Syetea  ia  not  really  completed  aa  yet,  hard  to  pin  down. 

IV. 3-41 


/ 


SYSTEM  (CONTACTS  Rick  Pint  Chuck  Rordby 

DaAnsa  IPSOOO  I  Sen  Jos4,  CA  (406)  263-715S 


IMAGE  MEMORY 

CONFIGURATIONS 
AND  COSTS 


Ml  tysteas  irt  essentially  512  x  $12  X  24  bit* 

system  II  2  channels  (1  icrxtch )  aonochrosw  6  bits 
syttaa  12  ]  channels  RGB  8  bit*  per  color  gun 

Syttaa  43  4  channels  (1  scratch)  N28  w/3  overlay  plane* 


PROCESSOR 
(SEE  ATTACHED) 


Pipe-line  Array  Processor  -  user  progr—abls 


ONE  PIXEL  I/O  AFTER  INITtAUZATtON  8QQ 


REFRESH  RATE 
INTERLACE 


POP  11/70  Treated  as  virtual  ■smnry-off  UNIBUS  by  us*  of  registers 


joystick  w/intarfacs 
trackball  w/ in ter face 
lightpen  w/interface 


FACILITIES 

HHAROWARE 

F-FIRMWARE 

S-SOFTWARE 

U-USER/PROCESSOfl 


Vector  generator  -  2.5  Ms  (sstiaata)  per  pixel 
H  Zoos  and  scroll  (sooa  2x,  4x,  8x) 

H  Color  Hap*  3  2S6  x.  24  aaps  -  display  only  one 

-ITU  -  Image  Translator  -  secondary  color  control 
K  Cursor 


HARO  COPY  FACILITY 


COST  OF  SASIC  SYSTEM 
DESIRED  EXTRAS 
TOTAL  COST 


COMMENTS 


laag*  processing  functions  ...  for  example t 
-Ctn  add  two  512  x  512  x  B  images  in  1/30  ssc. 
-aultiply  two  512  x  512  x  8  images  in  3/10  tec. 
-can  split  screen  with  separate  look-up  tables 

Alphanuasric  overlay  generator 


joy-tick,  lightpen,  ITU,  alpha/nua  generator  overlay 


>4^,000 

448,025 
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Tyt  cel  System  fl'oc*  Diagram 


Modssow 

Th#  iKONAS  processor  is  fully  user  progrimmabW  A  f#*t  32  bit  w*d*  architecture  give* 
unparalleled  precision  (or  graphics  and  imago  processing  sppltcetions  Th#  ihONAS  proceeaoc 
speeds  image  computation  by  tiecutmr  many  repetitive.  dm#  consuming  calculations  from 
microcode  Program*  Graphics  ano  imag#  processng  performance  is  'urther  annsncac  by  allowing 
tha  host  computer  direct  accass  lo  th#  image  memo’v  as  well  aa  to  ary  oth#r  wsmoiy  on  tn# 
IKONAS  bus  |cokx  too* -up  tao>#.  microcode  alor#.  ate) 


MUCK  mcmomv 

IKON  AS  imag#  M#mory  is  Ort  (Wan#  organized  Each  module  can  b#  addressed  a*  lQ24*Sl?*1. 
512*512*2.  or.  tor  muth-piaei  acc#ss.  as  1QK02-  Pan.  and  scroll  hi  pn#<  increments  a<)  Standard  as 
is  room  to  any  integer  ratio  1  1  to  256  ’  Modular  nature  of  th#  units  j  low  mam  or/  to  b«  #as*ly 
expended  from  Sl2*5l?*2  up  to  Sl2xSl2*32  or  1024*1024*16. 


PAST  MASOWAMC  NUCTtM.HA 

Tha  multiplier  accumulator  module  Facilitates  iff#  rapid  execution  of  many  graphics  and  image 
processing  tasks  which  raqjir#  mv..ipiy  than  add  or  subtract  uyces.  a  g  main*  multiplication  (3*0 
pomt  transformation),  vector  dot  and  cross  product  (shod#  calculations),  snd  weighted  averaging 
(SMt-aiiAvng)  Four  moduieo  orating  m  paratfol  allow  sub-miero?econd  3-0  pom.  transformation 


VIOtO  INPUT 

Video  aignait  may  be  written  into  th#  tm#g#  memory  in  r#«.  lim#  Th#  rugn  s pee*  but  archit#ctur# 
of  th#  iKONAS  system  snows  simultaneous  1  ?  Mbyt*s*c  v>d#o  input,  10  Mbyiwsec  v<d#o  output, 
and  2  Mbyt#r*#c  host  data  transfer 


ANIMATION 

Computer  graphics  animation  is  a  fast  developing  field  wit*  applications  n:  physical  system 
modeling.  o>spuiy  of  time  varying  data.  «no  cartooning  ;*VNAS  system*  support  computer 
emmation  using  color-map  or  n.viength  encoding  (ecnntgues  with  a  variety  of  color  iook-up  tabier 
ano  rurvangth  oecodem  ’  mag#  Memory  serves  at  a  ryn.ieogth  animation  buffer  tor  encoooo 
•mages  as  wall  aa  frame  butter  for  unencooeJ  images.  The  Mats  image  Storage  module  can  hold  up 
to  60  seconds  or  mo'.'w'vtety  compie*  animation  tor  real-time  piaybac*  or  can  be  used  io  store 
unoncod*-  -iagw* 


rUKMIUTY.  CXPANOAtILlTV 

iKONAS  systems  •<»  ertbrety  modular  being  configured  from  various  module*  attached  to  a 
common  communication  bus  System*  are  easily  eapanoed  On#  cage  holds  20  card*,  multiple  cage 
configurations  a  e  possible  a  user  can  begm  with  a  simple  (ram#  buffer  ana  add  processor,  -mage 
input,  and  hardware  multiplier  module*  'star 


CUtTOMIZtO  ftVtTIMi 

Modular  det'?n  ot  components  means  mat  systems  are  configured  to  meet  a  customer's  oa/ticu'sr 
neeos  Eatensiv#  use  ot  microprogrammed  controller*  m  the  modules  means  mat  custom 
modification?,  are  aasiiy  performed  >n  many  cases  A  wide  variety  of  options  >t  available  'KOWaS  <s 
oanicuiariy  .merest  td  >n  providing  srste-of-the-an  hardware  fur  research  ano  special  puroose 
graphics  ar.d  image  processing  systems 
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STtTtM 


MAOIHmOftT 


■  CONTACTS  Muv  Mhlttaa  or  Nick  saqlend 
I  belaigh,  MO  (919)  S13-S401 


CONFIGURATIONS 
ANO  COSTS 


1014  tUlll 
Mu  of  20  cords 


processor 

(SIS  ATTACK  CD) 


REFRESH  RATE 

INTERLACE 

HOST 

INTERFACES 

PERIPHERALS 


Aig  ood  tut  32  bit  200  Mo  Microprocessor 
(aistet  SO*  cost  of  tbs  totsl  systea) 

ONE  POCEL  MO  AFTER  IMTUUZAT3QH  iVl 

100  Msec  an  bos  (1  cycle)  ...  4  cards  con  operate  in  sit.  100  ns 
(S12)  30  Hs  (S12)  60  H«  (512)  JO  Hr  (512)  100  Rs  (1024)  30  Hr 

1:2  111  lil  _  oTT 

POP  IlffO  QMS  rig  DM119 

VAX  i«se _ 

none  ss  yet  -  peripherals  are  hung  off  the  host 


facilities 

H-HAADWAAE 

E-FIRMWARE 

^SOFTWARE 

U-USERIPRQCESSOR 


rl(Pan)  Scroll  4  Zoom  (Is,  2s,  3s,  4s,  5s, 


H  Run-**  Enccde/Decode  . 
r  Min  sir  nd  Viewport  ing 
H  Curur 

H  Color  Table  1024  s  24 


animation  possible 


M  color  Table  1024  s  24  tow  Speed:  <2000  High  Speed:  S2800 

(Required  for  1024  *  1024) 

0  Possible  things  to  look  for: 

-fast  vector  generation  (NASA) 

-■real  tine"  hidden-line/surface  (NASA) 

-2D  s  3D  aodel  aanipulation 
-Edge  detection  end  anti-aliasing 


NTSC  ENCODER 


HARD  COPY  FACILITY 


COST  OF  BASIC  SYSTEM 
DESIRED  EXTRAS 
TOTAL  COST 


COMMENTS 


■Look  for  good  things  froa  Ikonas  ..."  (AED  snginaer) 

-advertised  ss  tooi  for  graphics  research 
-no  high-level  language  interaction 

-upgrade  to  ultra-high  rtsolution  requires  one  software  bit  change! 


IV.  3-45 


The  following  paragraphs  briefly  describe  the  various 
elements  of  the  RM  9400  Display  Generator 


FIQURI  1.  RM.M0Q  FUNCTIONAL  iLOCK  DIAGRAM 

•  Computer  interface  (RM-9000-XX) 

The  Computer  interface  prcvices  a  high-speed  imn  between 
me  nost  computer  ana  me  RM-9400  Duplay  Generator  A 
general  oppose  interface  GPtF)  is  provided  on  the  Dismay 
Processor  Two  additional  card  siois  are  reserved  tor  custom 
nieriuces  Ott*Jhe-$neit  .menaces  are  ava  iaoie  tor  most 
mmicomouters  anc  some  targe  mam^ames  An  mterfaces  are 
’6-O't  oaraiiei  Most  -ncoroorate  or  ufi‘t*e  direct  memory 
access 

•  DMA  Sequencer 

T*»e  T T\,  DMA  Sequencer  performs  nigh-speeo  non. 
processor  transfers  •"voiving  myii’pie  devices  on  me  System 
Bus  *or  example  jet  ween  me  Computer  interface  and 
Display  Processor  or  Vemorv  Conroi  Processor  Tne  DMA 
Sequence'  can  .nvoive  as  many  as  14  ports  and  seven 
suo:oods 

•  Duplay  Processor 

Tne  D'Scav  Processor  direct1/  or  indirectly  control  eacn 
e'er’neni  o'  me  cso-ay  system  :n  aacit-on  -t  decodes 
rece  ved  nstruct-ons  stores  suPO'Ctures  1  command  lists <  and 
‘onts  performs  coordinate  t'ansformat  ons  and  dnves  tne 
Memory  Control  Processor  Tne  Display  Processor  contains  a 
Z8Q  microorocessor  wn  32tf  oytes  eacn  ot  EPROM  ana 
Ram  3  GP  C  ntertace  three  senai  ports  a  t  mer  memory 
mao  cvcte- ste3"ng  DMA  ana  nterr-ot  ccmt'O'  ogic  The 
memory  map  accommodates  JO  fO  5'  2*  memory  CyteS  O’ 
wn-rh  96K  oytes  are  '#$fryed  *o <  -memai  control  software 

•  Processor  Expansion  Modu'e  (RM-9400-PEM1  2  3  4) 
The  Processor  E«oans  on  Mocuie  aods  a  nign.soeed  mat'’ 

M.n-1  jo  to  32k  ovtes  EPROM  ootert-ai  ana  ac:a«tiona-  user 
RA.m  'o  :ne  0'Spiav  processor  Memory  exoans  on  mav  oe 
specked  n  32k  oyte  "C'emeors  to  a  maximum  o»  '28k 
oytes  ■  wnere  *  *  numper  ot  32K  ovte  nc»ements: 

- Memory  Control  Processor 

T"e  Memorv  C  jn'-rji  processor  draws  ;y-m.t.ves  a'pna- 
numencs  g-aon'cs  maqes  etc  ■  me  '“■e  retre$n  memory 
an d  per'orms  •;«  oping  ent.tv  detect  cn  oan  ana  zoom  ”•'*? 
MCP  comams  a  soeoai-ouroose  1 6-O't  piooiar  mcro- 
processor  wth  aed  catea  Q0M  RAM  ana  support  'ogc 


•  Refresh  Memory  (RM-Y  X  2) 

The  Retresn  Memory  consists  of  soiid-stefeMOS  RaM  s  mat 
store  the  picture* $)  m  raster  scan  dot  mam*  formal  The 
memory  >*  organized  as  one  to  eignt  groups  of  uo  to  i 6-pus 
teen  Each  i6*t>t  ceil  defines  a  smgie  pixel  on  one  or  more 
CRT  s  Taote  t  <ists  me  ooss'oi*  resolutions  aspect  ratios, 
and  refresh  frequences 
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•  Serial  Lmk/Curfor  Option  (RM-9400*$LC2'4) 

The  Senai  i.mk  Cursor  option  processes  operator  input  from 
keyboards  and  grapnic  input  devices,  and  generates  two  or 
tour  independent  cursors  mat  can  pe  used  to  pomi  to  me 
tace  of  tne  display  without  affecting  the  data  m  refresh 
memory  The  RM-9400-SLC  consists  of  a  280  micro¬ 
processor  with  dedicated  ROM  and  RAM.  four  or  eight  senai 
ports  and  two  or  four  32  x  32  programmable  cursor 
generators  Support  software  is  avaitaoie  for  Keyooards. 
ioysticKs  tracKoans  ;ignt  pens  and  graphic  tablets 

•  Video  Generator  (RM-9400-Vn) 

Tne  Video  Generator  transforms  the  stored  0‘Ctures  mto 
mdustry  compatible  video  s-gnais  mat  drive  Ramtek  or  ome? 
commercially  avaiiaoie  high  resolution  CRT  monitors  large 
screen  proiectors  and  hardcopy  printers  Ail  outputs  are 
comoatioi*  with  Ea  Sic  RS- 1 70  or  RS-343-A  specifications 
for  composite  vioeo 

The  viquo  generators  process  data  on  a  pixei-oy-o-xei  oasis 
through  PROM  or  ^AM  detmea  lookup  tables  that  assign 
output  coior  and  or  ntensitv  Each  pi*ei  -noexes  tne  iookud 
uoie  as  -t  $  scanned  trom  me  refresn  memory  The  contents 
of  tne  addressed  ce'i  n  tne  -ookud  table  are  men  passed  'o 
the  cfo-n  -nog  converters  (DACi  or  v>oeo  ampuf-e's  that 
croc..  out  video  s  gnais 

Cursor  anc  Ovef[ay  m  x.ng  s  Performed  either  n  the  >ookup 
taoie  or  at  the  DAQ  o\  ciampmg  me  output  voltage  to 
minimum  o'  max-mpm  sca<e  Ah  ,  ipeo  generators  ncoroorate 
a  pnnx  trequenc.  generator  mat  aiiows  se'bct  ve  punh 

There  are  three  off-tne-sneit  vioeo  generators  mat  satisfy 
most  aooiicat'ons 

a  Tne  T  ype  '  Video  Generator .  RM-9400-v  t )  s  designed  'o' 
general  qraon'cs  appiicat  ons  The  RM.940C-V1  drives  - 2 
two-pit  id-  even  v-aeo  OjiPutS  to  t  2  monochrome  or  'Our 
RC>B  coior  C'Sp'ays  m.  aaciticn  tne  RM-9400-vt  provides 
nardvvare  ptihH  and  mixes  up  *0  ‘Our  naepencent  cursors 
Af'h  any  ot  tne  '2  output  channe'S  Coior  ntensity 
ovenav  and  O'""*!  ass-gnment  are  accomplished  oy  opOM 
CCdmg  Any  Ol  b4  colors  Or  ‘our  -ntensihes  may  pe 
soectiea 


•  / 
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SYSTEM 

Raatek  9400 

CONTACTS  Kan  Mullany 

Boston,  Mams.  (617)862-7720  Sunnyvale,  CA  (408)735-4800 

IMAOE  MEMORY 

CONFIGURATIONS 

AND  COSTS 

812  X  812  x  1  plans  $  459 

1024  x  1024  x  1  plana  $2345 

Maximum  of  8  plsnaa  par  chassis  1 

(add  $2000  for  larger  chassis) 

FROCESSOR 
(SEE  ATTACHED) 

Z-80  traffic  eontrollar  ...  not  recoenended  for  user  progressing 

MEMORY  ACCESS 

ONE  PIXEL  VO  AFTER  INITIALIZATION  1.12  Ma 

REFRESH  RATE 

INTERLACE 

(812)  28.  30  Hs  80,  60  Hx  (1024)  25,  30  Hr 

2(1  111  2:1 

HOST 

INTERFACES 

PDF  11/70  DftllB  s  c  (will  also  quota  Mass  bus  in  futura)  $2200 

VAX  Similar 

PERIPHERALS 

keyboard  sisoo 
tablet  $2000 
joystick  $1400 
lightpan  $2900 
trackball  $3000 

FACILITIES 

H-HARDWARE 

F-FIRMWARE 

S-SOFTWARE 

U-IISERfPROCESSOR 

(S) 

<s> 

(S) 

(S) 

H 

H 

H 

S 

F 

F 

H 

H 

H 

H 

Vector  Generator  •  16.000  Veetors/aec  (50  pixels/vector) 

Zoom  (2x,  3x,  4x,  Sx  ...  16x)  and  scroll 

Color  table  (2)  102  x  16:4  per  RGB  and  4  monochrome  hard  copy 

-  Second  video  board  for  8  bits  par  RGB  $6120 

Arcs,  Circle  fill  and  Polygonfill  s  500 

area  fill 

2D  translation,  rotation  and  scaling  $  500 

Viewport ing 

Decluttering  -  increased  detail  with  zoom 

Sjtity  detect  into  display  list  and  return  to  host 

Down-load  display  list  board  w/list  processing  $3230 

-gives  power  to  directly  address  image  data  commands 

MONITORS 

512  -  60  H*  $3970  1024  -  30  Hs  $10,200 

NTSC  ENCODER 

HARO  COPY  FACILITY 

Poloroid  system  $12,000  38  am  earners  $  adapter  $4000 

COST  OF  SASIC  SYSTEM 

DESIRED  EXTRAS 

TOTAL  COST 

with  6  bits  par  RSI  in  color  table  $41,100 

keyboard,  tablet,  joystick,  lightpan,  polygon  filler, 

2D  transformations,  download  display  list  processor  $53,100 

COMMENTS 

-Maintenance  and  training  course  -  4  weeks  in  CA  $  1,000 

-Upgrade  to  1024  requires  sync  and  backplane  change, 

and  possibly  return  to  shop  *$  2,000 

IV.  3-47 


Vision  Orte/20 


Q  Displayed  spatial  revolutions  ot  512*512  or  1024*1024. 
guaranteed  to  mifi  rv  fiend  test  mecdiealions  defined  m 
IJuontnatrve  t v jlojiwt  oi  Son  Copy  Displays 

•  Up  to  512  'wparjte  bus  can  represent  each  J"d  every  picture 
element  in  j  51 2  *  SI  2  disp^v  presentation  t12B  Nts  per  picture 
elements  m  4  1024*1024  ditplavt. 

•  up  l»»  l  M.IMU.000  h*l4  ol  iRUNf  refresh  data  base  memory 
.ixjiliiliH1  m  ,i  single  9-ioot  electoral  c aNnrt 

•  {vw\  one  tit  i hr  1 14  ooo  000  bih  ol  the  refresh  memory  ,w  read 
(Hil  Inf  display  in  1/90  Ol  4  second 

•  Enough  image  refresh  data  ha*e  memory  is  available  i«  allow  .i 
t  omplefr  4099  *  4049  *  9  htl  arrav  lo  hr  viesved  m  real-time 

•  Am  one  hi*  oi  thr  refresh  memory  can  br  randomly  addressed  and 
re.gl  out  m  900  njno^cond' 

•  Dynamic  .iwtmmfni  ol  ima*e  mrmorv  tor  n»hr»  thr 
representation  ol  brightness  mciemenh.  oi  an  image  nr  oor  htr  rttx 
map  overlays 

•  image  refresh  mrmorv  mav  br  airtveri  m  spatial  coniiguraiHwu  ix 
512*  )27M  or  52.799*512  picture  elements  or  an*  other 
configuration  in  4  52.799*  12.799  space 

•  Refresh  mrmorv  configuration  assignment  completely  dvnjnm 
and  entirely  undrr  tionware  control 

•  Full  color,  high  r  delay  co*or  pirtmiahon  tup  to  2«*  brightness 
I rvrU  irfMgr  refresh  4nd  display)  available  in  all  ol  thr  spatial 
conjuration  mentioned  above 

•  Full  $12*512  resolution  real-urn*  roam  oi  a  large  data  haw*  is 
provided  in  4  moving  window  presentation  wxh  no  restrictions  on 
thr  direction  nr  rair  01  movement  01  thr  window  presentation 
across  thr  utteih  mrmorv  data  base 

•  Zoomed  patent  of  *>n  <y  any  259*259  or  129*129  ptrturr 
rlrmrni  arra  ot  (hr  rmr*h  mrmorv  data  base  with  tyll  window 
cmmImIiu  a*  n 
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•  loop  movie  prewMfiatinns  ix  up  to  94  separate  512*512  spatial 
resolution  tramns  25h  srpjr.itr  trames  at  259  *  259  spjn.it 
resolution  and  1024  *r(Mrair  names  at  1 28*  129  spatial  resolution 
Each  irame  mas  have  up  to  259  brightness  levels 

•  Completely  >ndepeodent  u>a*  ot  the  retresh  mrmorv  data  base  by 
up  to  4  u'wr-  Each  u>er  supplied  with  separate  kev hoard  control 
and  mtteprrxkmt  lull  mtor  video  output 

•  Allocation  ot  portions  ot  thr  refresh  memory  data  Ihee 
dvnamicjllv  assignable  between  users 

•  Retresh  memory  data  has*  data  is  loaded  completely  independent 
ot  display  presentation  idual  ponrd  Consltucnonr 

•  Complete  random  addressability  to  a  single  p*ctu»e  element 

•  Automatic  Nock  transfer  or  image  data  provided  with  the  abddv  to 
Ifxirt  sequentially  trom  either  side  to  either  side  or  top  to  bottom  or 
Itnitom  to  top 

•  freeze  irame  lunsier  nt  image  data  synchronously  into  image 
memory  at  rates  01  1/10  ot  a  second 

•  Real-time  rewriting  N  the  retresh  meroorv  data  base  on  the  Iwsis 
01  processing  algorithms  m  the  output  section  im  the  displ.n 

•  Herat  1  w  re- pox  essing  ot  the  refresh  sgwed  dau  through  system 
contained  processing  algorithms  performing  true  "pipeline 

I  woe  essmg  nt  ihc«  refresh  stored  data  with  each  pro*  essing  step 
taking  P  lit  ix  a  second 

•  Image  comlxmng  capability  on  tN1  oasis  ol  plus  minus.  mulh|ds 
ami  disuk‘ 

•  Real-time  IX. h  k  and  white  01  lull  uXtx  image  ((imposition — 
allowing  the  ruxs-desthn  use  super  imposition  iX  regular  or 
irregular  <Jvtped  portions  iu  images  one  upon  the  other  with 
congdefe  trvedom  ix  nondestructive  tranvIatHm  of  the  super 
■nggisition  sax  imn  m  any  direction  The  twajlting  composition  can 
he  msiands  u*ed  to  iofn>  and  store  an  entirety  new  image 


SYSTEM  CONTACTS  itarvay  Raidar 

COMTMi  Vi. non  Cnt/2C  (212)  797-117$ 

IMAGE  MEMORY 


Faiadana,  GA 


CONFIGURATIONS 

AND  COSTS 

incremental  by  512  x  512  x  8  bit  image  groups  $8,000  to  $10,000 

maximum  of  64  groups  *  512  bits  par  pixel 

PROCESSOR 
(SEE  ATTACHED) 

Pipeline  processor  to  recompute  all  picture  alamants 

LSI  micro  to  handle  usar  intaraction,  systam  raaponaa,  memory  management 

MEMORY  ACCESS 

ONE  PIXEL  I/O  AFTER  INITIALIZATION  M  ^  (r„d  900  Nf) 

REFRESH  RATE 

INTERLACE 

30  Hi  60  Hi 

1:1 

HOST 

INTERFACES 

POP  11(70  RSX11-N  is  availabla.  Unibus  board  t  DRllB  53150 

VAX  Similar 

PERIPHERALS 


keyboard 
trackball 
data  tablat 

magnatic  tapa  transport 
floppy  disk 


FACILITIES 

H-HAROWAAE 

F-FIRMWARE 

S-SOFTWARE 

UUSEWPROCESSOR 


High  powered  image  processing  facilitias  -  upper  ranga 
-can  roam  a  <096  x  4C96  x  8  bit  array 
-134,000,000  bits  of  rafrash  memory  raad  in  1/60  sac. 
-animation  of  64  512  x  512  x  8  imagas  in  msmory 
•raal  tiaa  (1/30  sac)  imaga  procasaing  faaturas 


(list  is  availabla) 
+ 


monitors 


IS'"  high  quality  monitor  included 


NTSC  ENCODER 

HARD  COPY  FACILITY 

COST  OF  SASIC  SYSTEM 

DESIRED  EXTRAS 

Pricas  from  approx.  $40,000  to  $700,000 

approx.  $40,000 

TOTAL  COST 
COMMENTS 


-computer  built  into  systam 
-top-of-tha-lina  for  imaga  procasaing 


Bulldln*  37,  ,'oom  509 
September  26,  1979 


To:  V.H  lucke,  Design  Graphics  Personnel,  J.F.  Berkery,  W.E-  Lorensen, 
J.L.  Mundy,  R.B.  Seltzmen 

Subject:  Updtte  #1 

Survey  of  Color  Video  Fra**  Buffer  Systems 


1.  In  a  resent  telephone  conversation  with  Ken  Anderson,  of  the  Anderson 
Report,  1  found  that  no  further  information  had  been  uncovered  regarding 
Seiko's  (Tokyo)  digital  TV  display.  Rumor  had  It  that  Seiko  had  utilized 
a  4  x  4  transformation  matrix  similar  to  the  Evans  and  Sutherland  Picture 
System  II  with  a  512  x  512  full  color  frame  buffer  system. 

2.  Mr.  Anderson  did  give  me  a  name  to  ccntact  at  DEC  In  Nashua  Mew  Hampshire 
regarding  their  rumored  512  x  512  color  video  frame  buffer  system.  DEC’S 
System  Processor  Is  based  on  2901  architecture  utilizing  a  160  Ns  cycle 
time.  The  system  will  act  as  a  device  on  the  Unibus  with  a  parallel  Interface. 
The  Initial  system  will  be  able  to  draw  50,000  vectors  (short  or  Inch?)  per 
second,  but  very  few  other  facilities  will  be  offered  and  the  processor  will 
not  be  user  programable.  The  Initial  system  will  offer  a  maximum  of  4  512  x 
512  image  memory  planes,  with  a  19  Inch  color  monitor  and  interface  for  approx¬ 
imately  $14,000.  It  will  be  available  for  shipment  around  June  1980  and 
development  will  continue  to  Improve  the  system. 


Peter  Atherton 

37-509 

8-169? 
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MISSION 

of 

Rome  Air  Development  Center 

RA VC  plana  and  execute*  research,  development ,  test  and 
selected  acquisition  programs  in  support  of.  Command,  Control 
Communications  and  Intelligence  (C3l)  activities.  Technical 
and  engineering  support  mtkin  areas  o{  technical  competence 
is  provided  to  ESP  Program  Offices  (PCs)  and  ether  ESP 
elements.  The.  principal  technical  mission  areas  me 
communications,  electromagnetic  guidance  and  control,  sur¬ 
veillance  of  ground  and  aerospace  objects,  intelligence  data 
collection  and  handling,  information  system  technology, 
ionospheric  propagation,  solid  state  sciences,  microwave 
physics  and  electronic  reliability,  maintainability  and 
compatibility. 
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